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THE APPLICATION OF THE THEORY OF PHYSICAL 
MEASUREMENT TO THE MEASUREMENT OF 
PSYCHOLOGICAL MAGNITUDES, WITH 
THREE EXPERIMENTAL EXAMPLES 


Part I 


INTRODUCTION 


THE PROBLEM 

HEN an observer in a discrimination 
VWiicecon makes a differential re- 
sponse to a stimulus, he is responding 
with respect to some discriminable char- 
acteristic. This characteristic may be de- 
fined in general terms as the combined 
effect upon discriminatory behavior of 
the experimenter’s operations of stimula- 
tion and instruction. Discriminable char- 
acteristics; so defined, are “subjective” 
only in the sense that they are clearly 
distinguished from the stimulus-corre- 
lates. They are not identified with a 
private, immediate experience. 

Every stimulus to which the observer 
can make a verbal response must have 
given rise to at least one discriminable 
characteristic. However it is probable 
that every stimulus produces more than 
one. For example, an auditory stimulus 
produces such discriminable character- 
istics as pitch and loudness; a visual 
stimulus produces such characteristics as 
hue, brilliance and saturation. 

With respect to many discriminable 
characteristics the observer will be able 
to make verbal responses of “greater 
than,” “less than,” “‘not greater than” and 
“not less than,” Those discriminable char- 
acteristics to which this type of verbal 
response can be made may be said to exist 
in discriminable degrees. 

The purpose of this study is to examine 
_the possibility of measuring those dis- 
criminable characteristics that exist in 
discriminable degrees. 


Many _ discriminable characteristics 
have known physical correlates, For ex- 
ample, the chief physical correlate of the 
discriminable characteristic of weight is 
the “physical weight” of the object. The 
measurement of “physical weight,” in 
fact the measurement of all physical cor- 
relates, is a problem for the physicist, but 
the measurement of the so-called subjec- 
tive magnitudes is a problem for the 
psychologist. ‘This statement assumes, of 
course, that there is a difference between 
physical and _ subjective magnitudes. 
Although this difference seems obvious 
from the common sense point of view, it 
is not too easy to frame a monistic opera- 
tional definition of the two types of 
magnitudes. 

A discriminable characteristic has been 
defined as the combined effect on dis- 
criminatory behavior of the _ experi- 
menter’s operations of stimulation and 
instruction. The instructions include the 
directions to make a judgment of “greater 
than,” “less than,” “not greater than” 
and “not less than.” The discriminatory 
behavior includes four different re- 
sponses, one for each of these judgments. 
If for given conditions of stimulation and 
under the appropriate instructions, the 
observer makes these responses so as to 
meet some arbitrarily predetermined 
standard of consistency, it is said that he 
is able to discriminate degrees of the 
characteristic. But this discriminatory re- 
sponse is fundamental to both physical 
and subjective magnitudes. The two can- 
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not be differentiated in terms of this basic 
I esponse. 

If there is a difference it must be found 
in differences in the operations of stimula- 
tion or in the remaining operations of 
instruction. : 

There does seem to be one difference 
between the procedure adopted by the 
physicist and that adopted by the psy- 
chologist. Having identified a character- 
istic, the physicist in the interest of 
consistency and greater discriminatory 
power, usually abandons it for- another 
characteristic which is correlated with the 
first one. For example, the characteristic 
of subjective weight may be identified by 
a series of operations which involve, 
among others, the operation of hefting. 
Che physicist will find that he is able to 
construct a magnitude that correlates 
with the original subjective magnitude 
but which substitutes operations involv- 
ing balances for the operation of hefting. 
Furthermore the discriminable charac- 
teristic is changed from subjective weight 
to some spatial characteristic such as the 
position of a pointer. 

In many cases the change of character- 
istic will be a good deal less obvious than 
it was in the case above. The difference 
between the operations for scaling subjec- 
tive and physical length is a case in point. 
Even here examination shows that the 
operations adopted by the physicist entail 
a change of discriminable characteristic. 
Ihe scaling of subjective length will en- 
tail a judgment of the overall length of 
the stimuli. But the physicist will place 
the two stimuli side by side and give in- 
structions that demand that the observer 
disregard the overall length and make a 
judgment concerning the presence or ab- 
sence of a difference. 

By this means the physicist is able to 
extend the scale beyond those limits im- 
posed by the low discriminatory acta Waid 





THOMAS WHELAN REESE 


of the observer with respect to the origi- 
nal discriminable characteristic. He is 
able to define magnitudes above the up- 
per limit for the original characteristic 
and below its absolute threshold. He is 
also able to increase differential sensi- 
tivity. 

’ Although the psychologist may change 
his instructions and alter the conditions 
of stimulation in the interest of consis- 
tency and greater discriminatory capacity, 
unlike the physicist, he cannot change 
the discriminable characteristic. 

There are not only a large number of 
discriminable characteristics, but also a 
large number of ways in which these 
characteristics may differ from one 
another. But characteristics not only dif- 
fer from another; many characteristics 
are similar to one another in respect of 
certain aspects they have in common. 
For example, hue, brilliance and satura- 
tion are all mediated by the same sensory 
mechanism. These features, by means of 
which discriminable characteristics may 
be described and classified, will be called, 
for the purpose of this discussion, the as- 
pects of discriminable characteristics. Cer- 
tain discriminable characteristics may be 
classified as qualitative and others as in- 
tensitive (Boring, 4); quality and inten- 
sity may be said to be aspects of dis- 
criminable characteristics. Auditory pitch 
has an aspect of quality and auditory 
loudness an aspect of intensity. 

It may be possible to find an aspect 
that makes it difficult to measure all the 
discriminable characteristics possessing it, 
regardless of any other aspects which the 
characteristics may or may not have in 
common. Likewise, aspects may be found 
which permit the characteristic to be 
measured easily. 

Now, it is obvious that it is impossible 
to prove that every discriminable charac- 
teristic is measurable unless every char- 
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acteristic. is actually measured success- 
fully. Only a probable inference can be 
drawn from partial evidence. The validity 
of the inference will depend, in great 
part, upon the adequacy of the sample. 
Adequacy may refer simply to the num- 
ber of cases that are selected at random. 
By this definition an adequate sample is 
one that contains the number of cases that 
allows reasonable probability that the 
distribution of the aspects is approxi- 
mately the same in the sample as in the 
population being sampled. There is no 
criterion for adequacy defined in this 
way. All one is able to say is that the 
greater the number of cases, the greater 
is the probability that the sample is 
adequate. 

For the purpose of this study there is 
another way to look at adequacy. If the 
presence of a certain aspect leads to the 
belief that the characteristic having it 
will be difficult to measure, then it is 
possible purposely to select those dis- 
criminable characteristics which have 
those aspects that offer the least chance 
for successful measurement. If, then, they 
are measured successfully, it may be ar- 
gued that the probability for the success- 
ful measurement of ail discriminable 
characteristics is increased. The validity 
of this procedure may be further in- 
creased by selecting several different dif- 
ficult aspects. By this method of selection 
of the characteristics to be measured, both 
conceptions of an adequate sample are 
combined. 

The first thing, then, that needs to be 
done is to classify the aspects of discrimi- 


nable characteristics and to choose several 


of them according to the above principles. 
There are at least four categories of 
classification that might be important: 
1) The sense modality. 
2) The aspects of “conscious dimen- 
sions” as described by Boring (4): a) the 


qualitative dimension as exemplified by 
auditory pitch; b) the intensitive dimen- 
sion as exemplified by loudness; c) the 
extensitive dimension as exemplified by 
length or volume and d) the protensitive 
dimension as exemplified by time. 

3) The characteristics may have the 
aspect of being palpable or impalpable. 
Impalpable is Titchener’s translation of 
unanschaulich, the adjective given to 
Ach’s Bewusstheit (awareness). The char- 
acteristic may be a “vague, intangible 
conscious content that is not image or 
sensation,” to quote Boring’s (3) descrip- 
tion of Bewusstheit. 

The difference between palpable and 
impalpable may be exemplified by com- 
paring auditory pitch and the experi- 
enced difficulty of doing a mental test 
item. 

‘The pitch characteristic of a tone is not 
impalpable. Even if it fades and becomes 
intangible and vague, the nature of the 
experience is such that it may be recap- 
tured on a second presentation of the 
stimulus. But the experienced difficulty 
of doing a mental test item is of an en- 
tirely different order. Here the original 
impression may be relatively impalpable 
and when once lost can never be regained, 
for it is extremely unlikely that the ex- 
perienced difficulty will be the same on a 
second presentation of the same item. 

4) Stimulus correlation. The discrimi- 
nable characteristic may be known to be 
correlated with one or several aspects of 
the stimulus, as, for example, pitch is cor- 
related with frequency and _ intensity, 
loudness with intensity and frequency. 
On the other hand the correlated stimulus 
aspect may be unknown or exceedingly 
complex. 

Several subjective scales have been con- 
structed or are in the process of construc- 
tion. Stevens (38) has constructed a scale 
for loudness. Stevens, Volkmann and 
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Newman (39) have constructed a scale 
for pitch, which later has been revised by 
Stevens and Volkmann (42). Taves (44) 
has constructed a scale for visual numer- 
ousness (perceived number) and Taback 
(43) for perceived weight. 

Thus there are magnitude functions 
that may be classified under three sense 
modalities, vision (numerousness), audi- 
tion (pitch and loudness) and kinaesthe- 
sis (weight). They may be classified under 
three conscious dimensions, qualitative 
(pitch), intensitive (loudness and weight) 
and extensitive (visual numerousness). 
The characteristics are relatively palpa- 
ble and have known stimulus correlates. 
The stimulus correlates of pitch are fre- 
quency and intensity, of loudness are 
intensity and frequency; while the cor- 
relate of numerousness is the actual num- 
ber of stimulus objects, and the correlate 
of perceived weight is the physical 
weight. 

The three discriminable characteris- 
tics with which this experiment will deal 
are: 1) visual rate, or the perceived rate 
of the flash of a lamp, 2) the experienced 
difficulty of items in a memory span test 
(digits) and 3) the experienced difficulty 
of multiple choice items in a vocabulary 
test. 

It will be seen that these character- 
istics differ not only from those that have 
already been scaled but differ radically 
from each other. The chief differences 


are in respect of palpability and stimulus | 


correlation. Visual rate has the aspect of 
palpability and of a known stimulus 
correlate, i.e., the actual rate of the flash 
of the lamp. The experienced difficulty 
of items in a memory span test is rela- 
tively impalpable and the important 
stimulus correlate is known to be the 
number of digits in the series. In the difh- 
culty of words in the multiple choice 


| 


| 


vocabulary test the characteristic is im- 
palpable and the stimulus correlate is 
unknown. It is, for example, impossible 
to say that it is the number of letters in 
the word or its length in inches. 

The “physical dimensions” of these 
characteristics would seem to be differ- 
ent. Visual rate is protensitive and al- 
though it would be difficult to fit the 
physical dimension associated with ex- 
perienced difficulty into Boring’s classi- 
fication, at least it does not seem to be 
solely protensitive. 

The important differences for this 
study are those of palpability and stimu- 
lus correlation. Impalpability presents 
some difficulties, as a characteristic that — 
is intangible and fleeting will be more 
difficult for the subject to judge than 
one that is not, and the impossibility of 
recovering the experience will present 
some technical difficulties in designing 
the experiment. The lack of a stimulus 
correlate will also present some difficulty. 
In fact Guilford (21) has said that the 
complete psychophysical treatment of 
mental test data is impossible because of 
the lack of a physical evaluation of the 
stimulus. 

Guilford emphasizes the importance of 
this problem of the physical correlate 
because it seems to be the chief factor 
in obscuring the common ground be- 
tween psychophysical and mental test 
problems (22). He has found the relation 
between the psychological difficulty of a 
test item: and a corresponding physical 
evaluation of the items, using as his defi- 
nition of difficulty “percentage failing”’ 
and using as his test items certain of the 
Seashore tests. : 

There are in reality two problems 
here. One is the measurement of the 
discriminable characteristic and the other 


’ is the relation between the physical and 
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the subjective magnitudes. So far as 
measurement is concerned the “problem 
of the stimulus correlate solves itself. It 
will be shown later in this study that a 
physical evaluation of a correlated stim- 
ulus variable is unnecessary for the 
measurement of a subjective magnitude, 
provided only that one has some means 





by which stimuli of different subjective 
magnitudes may be physically identified. 
So far as the relation between the two 
variables is concerned, naturally it will 
be impossible to demonstrate a relation 
between some variable and a stimulus 
correlate if the stimulus correlate cannot 
itself be identified or measured. 
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ParT II 


THE Locic or MENTAL MEASUREMENT 


SECTION A. INTRODUCTORY 


ECENTLY a number of physicists (and 
R logicians) together with a group of 
psychologists have leveled a particularly 
vigorous attack against the theoretical 
concepts upon which psychologists have 
based their practice of measurement. 
rhe criticism has come to a head with 
the recently published Final Report of 
the Committee appointed to consider 
and report upon the possibility of Quan- 
titative Estimates of Sensory Events (14). 
‘he members of this committee were 
drawn from Sections A (Physics) and J 
(Psychology) of the British Association 
for the Advancement of Science: 

In the following sections the criticisms 
of both the physicists and the psycholo- 
gists will be examined in some detail. 
Suffice it to say here that the physicists 
have claimed that measurement in any 
true sense is impossible in psychology. 
(hey base this conclusion on what they 
consider to be the fact that none of the 
attempts at measurement in psychology 
meet the necessary logical requirements 
‘or fundamental measurement. 

They argue that psychologists must 
then do one of two things. They must 
either say that the logical requirements 
for measurement in physics, as laid down 
by the logicians and other experts in the 
field of measurement, do not hold for 
psychology, and then develop other prin- 
iples that are logically sound; or they 
must admit that their attempts at meas- 
urement do not meet the criteria and 
both cease calling these manipulations 
by the word “measurement” and stop 
‘reating the results obtained as if they 
vere the products of true measurement. 


For example Guild, who seems to have 
taken the most extreme position against 
the possibility of measurement in psy- 
chology, says, ““To insist on calling these 
other processes measurement adds noth- 
ing to their actual significance but merely 
debases the coinage of verbal intercourse. 
Measurement is not a term with some 
mysterious inherent meaning, part of 
which may be overlooked by the physi- 
cists and may be in course of discovery 
by psychologists. It is merely a word con- — 
ventionally employed to denote certain 
ideas. To use it to denote other ideas 
does not broaden its meaning but des- 
troys it: we cease to know what is to be 
understood by the term when we en- 
counter it; our pockets have been picked 
of a useful coin” (19). 

The Final Report of the committee of 
the British Association for the Advance- 
ment of Science holds out hope for a 
third solution, when in paragraph 10 
it states, “Some members, perhaps all, 
admit that their opinion might change 
if new facts were established; but the 
facts that would be necessary for this 
purpose are not of the kind that can be 
established by any experimental method 
at present in general use” (14). 

For convenience the discussion will 
be divided into sections. The logical 
criteria which the physicists claim that 
all measurement must meet will be pre- 
sented and discussed in Section B; the 
practical operations necessary for the ful- 
fillment of these criteria will be discussed 
in Section C; the position of Stevens, 
who has given the most vigorous reply 


intervals 


‘I.e., the method of equal appeari 
fferences. 


and the method of just noticeable 





APPLICATION OF PHYSICAL MEASUREMENT TO PSYCHOLOGICAL MAGNITUDES 7 


to the logicians’ criticisms, will be dis- 
cussed in Section D; the specific criti- 
cisms of the psychological operations for 
measurement which have been raised by 
the physicists will: be presented and dis- 
cussed in Section E; the criticisms of 
psychologists of their own methods will 
be presented and discussed in Section F; 
the special problem of zero subjective 
magnitudes will be discussed in Section 
G; it will be shown in Section H that 
measurement in psychology does not de- 
pend on the prior measurement of any 
other magnitude; Section J contains a 
brief summary of the discussion up to 
that point. 


SECTION B. THE LOGICAL REQUIREMENTS 
OF MEASUREMENT” 


A distinction must be made at. the 
outset between measurement defined as 
the construction -ef a scale and measure- 
ment defined as the use of the scale after 
it has been constructed. Utter confusion 
will result from the confounding of these 
two definitions. The use of a measuring 
scale after it has been constructed is a 
more or less simple matter involving the 
comparison of the object to be measured 
with the standard scale. The word meas- 
- urement is never used in this sense in this 
paper. As here used it refers to the more 
fundamental problem of scale construc- 
tion. 

Measurement, according to Campbell, 
is the assignment of numerals to systems* 
according to scientific laws. The scientific 
laws spring from the relations demon- 

*In this section the author has borrowed lib- 
erally from Campbell (5, 6, 7), Guild (1g) and 
Cohen and Nagel (9). No references are given 
except in those cases where an author is quoted 
directly or an illustrative example employed by 
the author is used in the text. 

* Campbell apparently uses the term “system” 
to mean any objects with which the physicist 
deals, The term would seem to include anything 


from pieces of wood or rock to electric lamps 
and voltage dividers. 


strated between the systems with respect 
to a certain magnitude. 

The first requirement for measurement 
is that it must be possible to arrange 
the systems to be measured in respect of 
a given magnitude, in an order, with 
respect to that magnitude. The result of 
this operation is known as an ordinal 
scale. To do this it must first be demon- 
strated by some operations that the rela- 
tion between the systems is transitive 
and asymmetrical. 

If the symbol means “bears a cer- 
tain relation to” and ¢ means “does not 
bear that relation to,” it must then be 
shown experimentally that the relation 
in question is asymmetrical, that is, if 


a—->BthenBfA I (I) 
and transitive, that is, if 
A—B and B->C, then A->C. II (1) 


If the above symbols are replaced by 
> (greater than) and } (not greater 
than) or by the converse < (less than) 
and ¢ (not less than), it would be neces- 
sary to show that, if 


A>B then B}A I (Il) 
and that if . 
A> Band B>C, then A>C. II (II) 


It will be seen that the relation = does 
not exist in such a series. An example 
of such a series without the relation =, 
given by Campbell, is the direct line of 
male descent. The generating relations 
are “ancestor of” and “descendant of.” 

The relation =, as between A and B, 
is associated with the following proposi- 
tions: . 

A = B if, and only if, 


1) A}B and A¢{B . Ill 
2) if A>C then B>C IV 
3) if A<C then B<C. V 


The relation = defined in this manner 


_ is always transitive, that is, if 


A=B and B=—C then A—C VI 
and is always symmetrical, that is, if 
A=B then B= A. Vil 


ae 
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On casual inspection it might seem 
difficult for a given relation to satisfy III 
and yet fail to satisfy IV and V but if 
the example of Campbell's, given above, 
is examined it will be seen that it would 
be possible for A to be neither the an- 
cestor nor the descendant of B and thus 
satisfy III, and yet A cannot then satisfy 
IV or V. To quote Campbell “. . . no 
two males can have all the same ancestors 
and descendants” (6). 

According to Campbell a magnitude 
must have the relation =, for, as will 
be seen later, this relation is necessary 
in the construction of an additive scale. 

He sums up the first conditions for 
measurement as follows: “The first condi- 
tion of measurement, namely that a 
magnitude must be capable of order, can 
now be stated formally as follows. The 
systems measured must, in virtue of the 


property concerned, be a field of a pair ~ 


of converse T.A.‘ relations and the T-.S.° 
relation associated with them; every sys- 
tem must be > or < or = every other, 
and must be = at least one other. The 
first law of measurement is the statement 
that this condition is fulfilled’ (6). 

The rule for assigning numerals to 
represent a series in which the above 
relations have been established is: if 
A > B then the numeral assigned to A 
must be greater than the numeral as- 
signed to B;® conversely, if B < A, then 
the numeral assigned to B must be less 
than the numeral assigned to A. If 
A = B, then the numeral assigned to A 
must be the same as the numeral as- 
signed to B, 

According to Campbell, the existence 
of the relation = is one of the things 
that distinguishes the order character- 

* Transitive asymmetrical. 

° Transitive symmetrical. 

* The problem of how a numeral, which has 


been defined as a symbol, can be greater than 
another numeral is taken up later. 


istic of magnitudes from that which is 
characteristic of numerals. Numerals, by 
which is meant simply a group of con- 
ventional signs or marks on a piece of 
paper, obtain their order by convention. 
The order is not determined by facts 
such as the order existing in the family 
tree. If only one of the many numeral 
series is used, every member is either 
greater or less than every other member. 
There is no relation =. (Naturally if 
several series were combined, such as the 
decimal and fraction series, than it would 
be possible to find two that were equal 
to each other, as 1.5 = 11%.) . 

However the most important differ- 
ence between numerals and magnitudes 
is that the order of the numerals is con- 
ventional while the order of the systems 
in respect of the magnitude is deter- 
mined by experimental operations. 

Numerals have by convention a transi- 
tive, asymmetrical relation. Now if they 
are going to be used to represent the 
order of the systems in respect of a cer- 
tain magnitude, it must be shown ex- 
perimentally that the relation between 
the systems which they represent is also 
transitive and asymmetrical. If it is im- 
possible to show this then the numerals 
that have been assigned are meaningless 
in as much as the conventional relations 
between them do not express the rela- 
tions between the systems. 

There is nothing in the experimentally 
established relation A >B that tells 
what numeral is assigned to A. The rule 
simply states that it must be greater than 
that assigned to B. As yet there are no 
operations to determine by how much 
A > B, so the assigned numerals cannot 
reflect a relation that has not been es- 
tablished. In other words, if 2 is assigned 
to B and 4 to A, it is impossible to say 
that A is twice as great as B because it 
has not yet been shown experimentally 
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that A is twice B. In other words, the 
numerals can express only those relations 
that have been shown experimentally to 
exist between the systems to which they 
are assigned. 

An interesting example of an ordinal 
scale is the Mohs scale of hardness. Mohs, 
in developing his scale for the hardness 
of minerals, tried to ordinalize minerals 
according to the relation “scratches.” 
The operation by which the relation of 
the minerals was to be determined was 
to attempt to scratch one mineral with 
another. He selected ten minerals to rep- 
resent particular points on the scale and 
assigned numerals to them. The numer- 
als ranged from 1 to 10, where the num- 
eral 1 was assigned to that mineral which 
could be scratched by every other mineral 
and which could scratch no mineral; and 
10 was assigned to that mineral which 
could scratch every other one and be 
scratched by none. It should be noted 
that there is nothing in the operations 
adopted by Mohs that tells how many 
more times as hard one mineral is than 
another. It simply tells that one mineral 
is harder than another, as defined by the 
operation of scratching, and therefore 
should be assigned a higher numeral. 

Later attempts at measuring hardness, 
defined by other operations, such as 
microscopic measurement of the depth of 
a scratch made by a diamond under 
constant pressure or the amount of work 
done in grinding away a certain weight 
or volume of material, have shown that 
the interval between Mohs’ hardness of 
10 and g was greater than the interval 
between 9 and 1 (3%). 

Molis assumed that his relation of 
“scratches” was transitive and asymmet- 
rical and that the = associated with it 
was transitive and symmetrical. This has 
been shown to be false. Some minerals 
have been found that satisfy III but not 


IV or V," that is they cannot scratch each 
other, and yet have different powers of 
scratching a third mineral. Because of 
this, according to Campbell, hardness as 
defined by the operation of scratching is 
not a magnitude at all. 

In order to determine what numeral 
should be assigned to A if A> B or, 
in other words, in order to be able to 
construct an additive or extensive scale, 
the property being scaled must be ca- 
pable of being “added.” The second re- 
quirement for measurement, then is 
that it must be possible to find some 
operation by which the magnitudes of 
two = systems may be combined to form 
systems that are +. In order to be addi- 
tive the proposed method of combina- 
tion must meet the following conditions: 
if 
A=A’andB>0,thenA+B>A’ VII 
A+B=X then B+A—X IX 
A= A’ and B= B’ then A+ B=A’+B’ X 
and 
(A +B) 4+ C= A’ + (B’+C’. xI 

To quote Campbell again, “The state- 
ment that these conditions are fulfilled 
by any proposed method of addition de- 
fined by + and ( ), applied to systems 
possessing any magnitude defined by >, 
<, and =, is the second law of measure- 
ment of that magnitude” (6). 

It is now necessary to have a rule for 
the assignment of numerals to the sys- 
tems to represent the new relations that 
have been obtained by the operation of 
addition. 

In discussing the assignment of numer- 
als it is well to stress again that it is 
numerals that are assigned and not num- 
bers. As Campbell says, “. . . it would be 
difficult to avoid the impression that the 
conception of number and the rules of 


. Throughout this monograph roman numerals 
will be used to refer to the logical requirements 
presented and discussed in this section. 
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arithmetic were concerned in the matter. 
Actually they are not concerned. Of 
course, they are closely connected with 
measurement; but if we fail to recognize 
that they are not essential we shall not 
understand the connection” (6). 

The first operation for the construc- 
tion of an additive or extensive scale is to 
select some system that belongs to the 
series of the magnitude. The selection is 
entirely arbitrary. Then another system 
is found that is = to this first system. If 
the numeral A is assigned to the system 
first selected, then the numeral A’ is as- 
signed to that system that is = to A. 
Since the systems must be identifiable, 
that is since it is absolutely necessary 
that they can be told apart by some 
method, it will be convenient to call the 
second system A’ to indicate that it is 
of the same magnitude as the system A 
but is a different-system. The ’ is not 
to be taken as an indication of a different 
numeral but of the same numeral as- 
signed to a different system. 

The next step is to “add” the systems 
A and A’ and seek a system that is = 
to their combination. To this system the 
numeral B can be assigned. Another sys- 
tem that is = to B is then found and 
assigned the numeral B’, B and B’ are 
“added” and another system C is found 
such that B + B’ = C, and so on. 

Using the ordinary numeral series in- 
stead of the alphabet and arbitrarily as- 
signing 1 to the systems previously as- 
signed A, B would equal 1+ 1 or 2. 
Likewise C = 2 + 2 or 4, Naturally num- 
erals may be assigned to intermediate 
systems. If a system A” is found that 


sible to find a system = A + A’ + A” to 
which the numeral 3 would be assigned. 

It is easily seen that a great advance 
has been made when it is possible to 
construct an additive scale. The ordinal 
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scale did not tell by how much A>B 
because there was nothing in the rela- 
tions established by experiment that de- 
termined by how much A > B. But now, 
once having chosen a standard, all the 
other magnitudes are uniquely deter- 
mined. It is known for example that C 
is not only > B but also that C= B + 
B’, because the operations performed on 
the systems have determined this relation 
experimentally. 

It is now time to examine more closely 
the rules for the assignment of numerals. 
Before this is done it will be necessary 
to clear up some questions of termi- 
nology. Much of the difficulty of the sub- 
ject of measurement seems to stem not 
only from the confusion of the meanings 
of the words number and numeral, but 
also from the fact that the word num- 
ber itself has several meanings. The fol- 
lowing definitions have been adopted in 
this paper. 

Numeral. A numeral is a sign or sym- 
bol that may be conventionally used to 
represent a number. In other words it is 
simply a black mark on a piece of paper. 
There are several conventional numeral 
series, 1, 2, 3, etc., or A, B, C, etc. 

Number. Russell’s definition of num- 
ber is not used in this paper. Number is 
here regarded as a discriminable charac- 
teristic of systems that may be measured 
as any other discriminable characteristic. 
The term is used in much the same way 
as Stevens (40) has used the term “numer- 
osity.” ““Numerosity,” he says, “is a prop- 
erty defined by certain operations per- 
formed upon groups of objects.” In_ his 
discussion he begins by saying that it is 
possible to establish a rank order of 
groups of objects [beans for example] in 
respect of numerousness® simply by look- 


* This kind of numerousness is called subjec- 
tive number in this paper. 
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ing at the piles of beans and judging 
which of the piles is largest, etc. 

“We know from experience, however, 
that greater reliability can be had if we 
rank-order the groups by pairing succes- 
sively one bean from each group until 
one group is exhausted. Then if any 
beans remain in the other group, that 
group is said to have the greater numer- 
osity. ... If the pairing exhausts both 
groups simultaneously, their numerosity 
is equal ...,” etc. It can be seen that num- 
ber defined in this way can be measured 
fundamentally (see also Campbell, 6). 

As this is not the usual definition of 
number it has been decided to call this 
kind of number “objective number” to 
distinguish it from number as a logical 
concept as used by Russell and from sub- 
jective number (numerousness). 

It will be remembered that systems ar- 
ranged in an experimentally established 
order had numerals assigned to them by 
the following rule: If A >B, then the 
numeral assigned to A must be greater 
than the numeral assigned to B and 
conversely if B<A then the numeral 
assigned to B must be less than the 
numeral assigned to A. Furthermore if 
A = B, the numeral assigned to A must 
be equal the numeral assigned to B. 

It is now possible to ask the question 
how can a numeral, which has been 
defined as a mere sign or symbol, have a 
magnitude? In short how can one num- 
eral be > or < or = any other numeral? 
One possible answer has already been 
implied, when it was assumed that the 
numerals used were those that are con- 
ventionally arranged in an order. In 
other words >, with respect to numerals, 
means “following” in the numeral series. 
Thus 2>1 because it follows 1 and 
D > B because it follows B in the num- 
eral series. 

However it is extremely important to 


note that it is not necessary to use a con- 
ventional numeral series in the construc- 
tion of the scale. Any other group of 
numerals would do as well. In the event 
that a group of numerals which did not 
have a conventional order were used in 
the construction of the scale, the nu- 
merals would be arbitrarily assigned 
to the various systems in the experi- 
mentally established ordinal series. But 
once having been assigned, their order, 
so far as measurement of the particular 
magnitude is concerned, would be 
uniquely determined by the order of 
the systems to which they were assigned. 
In other words, though the numerals 
did not originally possess an order, once 
they have been assigned to a group of 
systems, they represent the relations that 
have been determined experimentally 
between the systems in the ordered 
series. If the relation between the 
systems is transitive and asymmetrical, 
then the numerals express this transi- 
tive and asymmetrical relation; if the 
relation betweer: the systems is intransi- 
tive and symmetrical, the numerals ex- 
press an intransitive symmetrical rela- 
tion. Whatever relations the numerals 
express, they express only by virtue of 
the fact that these relations were shown 
to exist between the systems to which 
they have been arbitrarily assigned. 

It should be noted that this statement 
is also true if numerals with a conven- 
tional order are used. It so happens that 
by convention such a series as 1, 2, 3, 
etc., is transitive and asymmetrical and 
it so happens that the relation that must 
first be established between the systems 
is also transitive and asymmetrical. In 
other words both are an ordered series, 
one is ordered by convention and the 
other by experimental operations. For 
convenience, then, we assign numerals 
to the systems so that if A>B, the 
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numeral assigned to A is greater than 
the numeral assigned to B, always re- 
membering that the relation “greater 
than” when applied to numerals is a 
matter of convention. 

There is nothing in the size of the 
numeral assigned to A that makes A 
greater than B. For example if A > B 
and the numeral 2 is assigned to B, then 
any numeral that is’ conventionally 
greater than 2 may be assigned to A, 
say 4. But suppose that A > B and the 
numeral 2 is arbitrarily assigned to B 
and the numeral 1 is assigned to A. ‘The 
fact that 1 is assigned to A does not now 
make A less than B. In fact it works the 
other way; the fact that A has been 
shown to be greater than B means that 
the numeral 1 assigned to A must be 
interpreted as greater than the numeral 
2 assigned to B. A new series is brought 
into being by these operations in which 
the numeral 1 is “greater than” the 
numeral 2. This means that, that so far 
as this particular magnitude is con- 
cerned, the numeral series 1, 2, 3, etc., 
cannot be used or interpreted in the 
usual way, Le., 2 > 1, etc. 

The seeming absurdity of 1 > 2 arises 
because one is accustomed to think of 
1 and 2 as numbers, not numerals. The 
number conventionally represented by 
the numeral 1 is certainly not greater 
than the number conventionally repre- 
sented by the numeral 2, but the numeral 
1 may be regarded as either greater or 
less than the numeral 2 depending on 
the magnitudes of the systems to which 
these numerals are assigned. This would 
be highly inconvenient and it is much 
more reasonable to use the convention- 
ally ordered series. But it is also. im- 
portant to see that it is not necessary to 
use the conventional series, in order to 
make clear the fact that the numerals 
add nothing to the —) de- 


termined relations. 
The same reasoning that applies to 
the use of nonconventional numerals to 


represent systems in an ordered series 


applies to their use for representing the 
systems in an additive series. For ex- 
ample it was said that if B = A+ A’ 
and the numeral 1 is assigned to A, then 
the numeral 1+ 1 or 2 would be as- 
signed to B. But what meaning can be 
assigned to the statement, “the numeral 
i+the numeral 1 = the numeral 2”? 
There is nothing about the numerals 1 
and 2 qua numerals that would justify 
the conclusion that 1 + 1 = 2. However 
as convention numerals, 1 + 1 = 2 be- 
cause the numeral 1 has been convention- 
ally assigned to a certain objective num- 
ber and the numeral 2 has been assigned 
to another objective number. Further- 
more if we call the number to which the 
numeral 1 is assigned X and the number 
to which 2 has been assigned Y, and it is 
possible to show by a series of operations 
involving addition that Y= X + X’, 
then it is possible to.state that 1 + 1 = 2. 
But it must be carefully borne in mind 
that so far as measurement is concerned 
this numerical. statement has no mean- 
ing apart from the experimentally estab- 
lished relations between the objective 
numbers X and Y. When the criteria for 
measurement have been met the relations 
between the systems are analogous to the 
relations between objective numbers 
which are represented by the ordinary 
numeral series 1, 2, 3, etc. It is then 
possible to use the ordinary numeral 
series in its conventional sense and 
apply the powerful tool of arithmetic to 
the symbols with the knowledge that 
these arithmetic manipulations repre- 
sent, with only a small margin of error, 
the actual physical operations that 
might be performed on the systems 
themselves. 
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But it is not necessary to use the con- 
ventional numeral series, it is only con- 
venient to do so. Any other numeral 
series, or any other group of numerals 
could be used, though it would be neces- 
sary to construct a new arithmetic, i.e., 
new laws for the manipulation of 
numerals, if it was necessary to manipu- 
late these numerals rather than perform 
actual operations on the systems. 

The procedure outlined above for the 
construction of an additive scale results 
in what Campbell names an A-magni- 
tude, also sometimes called a funda- 
mental magnitude. An A-magnitude (or 
fundamental magnitude) is one for 
which a practical operation of addition 
may be found. There is another larger 
and very important group of magnitudes 
which are called by Campbell, B-magni- 
tudes, also sometimes called derived 
magnitudes. A B-magnitude (or derived 
magnitude) is one that is measured in 
terms of an A-magnitude. It cannot be 
measured directly because it is impossible 
to find a practical operation for addition 
that will meet VIII, IX, X, XI. B-magni- 
tudes may be of two kinds. To quote 
Campbell (6), ““The property measured in 
this manner may be nothing but that of 
being subject to the numerical law, and 
may be indefinable apart from that law. 
But it has often happened that the dis- 
covery of a numerical law, and of the 
constants associated with it, has enabled 
us to measure in this way a property 
that had previously been suspected of 
being a magnitude, but had not been 
actually measured.” 

The example usually given to illus- 
trate B-magnitudes is density. Density 
= mass/volume. Both mass and volume 
are A-magnitudes. As Campbell shows, 
it is suspected that density is a magnitude 
because liquids might be arranged in an 
order of magnitude by defining “denser 


than” by floats on. It could be shown 
that this relation is transitive by show- 
ing that if A floats on B and B floats on 
C, then A will float on C. It could also 
be shown that the relation is asymmetri- 
cal. If A floats on B, then B does not 
float on A. Equality could be defined as 
that state in which neither liquid will 
float on the other permanently. 

If liquids are then arranged in an 
order defined by the quotient mass/ 
volume and this order is identical with 
the order obtained by defining density 
by flotation, it is possible to say that the 
property measured by the quotient mass/ 
volume is the same as the property 
measured by flotation. To quote Camp- 
bell, “. . . the discovery of the law® has 
enabled us to measure a property pre- 
viously immeasurable’ (6). ) 

It may be asked why the quotient 
mass/volume is thought to measure the 
same magnitude that is measured by 
flotation. Campbell lays down the gen- 
eral principle that “the conception of a 
magnitude is inseparable from that of 
the order characteristic of it. It is natu- 
ral, therefore, to regard as the same 
magnitudes, or as magnitudes of the 
same kind, properties that invariably 
have the same order’’(6). 

Temperature is often mentioned as an 
example of a magnitude which is 
measured without a practical operation 
for addition. Guild (13) defines tempera- 
ture as “the condition of an object in 


virtue of which it may feel hot or cold 


to the touch. . . .” He states further 
that “Experiment has shown many ob- 
servable relations of a general kind be- 
tween the temperature of bodies and 
their measurable properties. The length 
and electrical resistance of a given rod, 

® The “law” refers to the fact that mass/volume 


is a constant for given liquids under defined 
conditions, 
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for example, are usually greater when 
the rod feels hot than when it feels 
cold.” 

In order to measure temperature as a 
B-magnitude it is only necessary to 
choose some measurable property which 
varies continuously with it. This having 
been done it is possible to postulate a 
law relating the property chosen to tem- 
perature. For example it is possible to 
postulate the law that equal increments 
in the chosen A-magnitude represent 
equal increments of temperature. 

The mercury thermometer is an 
example of such a postulated law. Equal 
increments of the volume of mercury are 
deemed to represent equal increments of 
temperature. It is obvious that the rela- 
tion of temperature to volume of mer- 
cury must be constant, If this were not 
true the scale would be useless. But how 
is it possible to tell whether the relation 
is constant? As Guild points out, it is 
impossible to determine the constancy 
of this relation by finding out whether 
various other phenomena bear a con- 
stant relation to temperature as defined 
by the mercury scale. That reasoning 

. is based on an a priori assumption 
of the constancy of natural laws” (13). 
Guild’s answer to this question is, “The 
point is that the constancy of the law 
defining our scale does not require con- 
firmation. It is not an assumption which 
may or may not be true, it is a postulate 
forming part of the conventional frame- 
work of. physical measurement. The 
postulated law is necessarily always true 
for the simple reason that it serves the 
purpose of defining temperature as ‘the 
thing for which this law is true.’ There 
is no criterion of the magnitude of a 
temperature (nor of any B-magnitnde) 
other than the law by which we choose 
to define it. It would therefore be mean- 
ingless to ask whether the temperature 


«< 


to which our scale assigns the numeral 
n is in fact the same temperature at all 
times and places’(198). 

The physicists seem to wish to restrict 
the term measurement to those magni- 
tudes that may be measured funda- 
mentally, i.e., A-magnitudes. For exam- 
ple Guild says (13), “The fact that there 
is no operation of addition applicable to 
temperature qua temperature, prevents 
it from being measurable in the true 
sense of the term.” But it should be 
noted well that he also says (13), “When 
once we have defined some, such scale 
of temperature, temperature becomes 
‘measurable’ in the broad sense in which 
this word is generally used; and the laws 
relating other physical variables with 
temperature as so defined become open 
to empirical investigation.” 

The author of this study thinks that 
the words “open to empirical investiga- 
tion” might also have been put into 
italics. 

Before closing this section it might be 
well to give Campbell’s definition of 
zero magnitudes although any discussion 
will be postponed to Section C. 

The system B has the magnitude O, 
when A = A’, if 
A+B=A’ XII 


SECTION C. THE OPERATIONS NECESSARY 
FOR FULFILLING THE PHYSICISTS’ RE- 
QUIREMENTS FOR MEASUREMENT 


In the section above no stress was laid 
on the necessary operations for meeting 
the criteria for measurement that were 
discussed. The discussion of the criteria 
and the discussion of the operations 
have been separated only for conven- 
ience, Actually the criteria and the opera- 
tions by which the criteria are satisfied 
are inseparable. It would be possible to 
describe the criteria that must be met, it 
would also be possible to describe the 
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operations that ought to be performed on 
a group of systems in order to measure 
them, and still it might be impossible to 
measure the systems in respect of the 
given magnitude, because the operations 
could not be carried out in practice. 
The criteria are not theoretical, they are 
practical. The relations stated in them 
must be shown to hold empirically. 

In glancing back at the criteria it will 
be noted that the following symbols were 
used: >, <, >, ¢, =, +, ( )* and +. 

Of these symbols, ( ) and + may be 
described as operations and the rest as 
relations. That is >, <, }, ¢, = and 
state that the relation “greater than,” 
“less than” or “equals” has been found 
to exist or not to exist between any two 
systems. While it is true that these sym- 
bols do not represent operations, they 
imply that certain operations have been 
performed on the systems so that this 
relation could be determined. 

For example, in order to determine 
whether the relation > existed between 
two systems with respect to any magni- 
tude it would be necessary to: 

1) State the operations by which > 
is to be defined. 

2) Actually perform these operations. 

3) Judge whether the operational 
criterion has been met. 

If the magnitude were weight these 
three steps might be applied as follows: 

1) Heavier than is defined by placing 
two objects on a balance, one on each 
pan. If one of the pans sinks, the weight 
in that pan will be deemed to be the 
greater; or, in other words, the system 
in that pan will be greater than the 
system in the other pan in respect of the 
property weight. 

2) It is now necessary to find a balance, 
obtain a group of systems, place pairs of 


*” The parentheses refer to a single system = to 
the sum of the systems included within them. 


them on the balance; or, in other words, 
actually carry out the operations used to 
define the magnitude. 

3) There must, of course, be some way 
of determining whether the pan sinks 
or does not sink. There may be several 
ways of determining this fact but all of 
them will ultimately rest upon a judg- 
ment made by the experimenter. The 
usual judgments are those of “difference” 


(which can be either “greater than” or- 


“less than,” or “no difference.” ) It should 
be noted again that the judgment of “no 
difference” is not the same as the judg- 
ment of equality. The judgment of no 
difference is implied in III. But in order 
to establish equality, it must be demon- 
strated that IV and V also hold. 

The question may now be asked, why 
does one choose one operation rather 
than ‘another? How, for example, does 
one know that the weights should be put 
in opposite pans of the balance? Why 
not place one weight in the pan of the 
balance and the other on the floor? 
Could not this operation define the rela- 
tion “greater than” for the magnitude 
weight? 

The answer is, simply, “Try it.” Sup- 
pose “greater than” is defined in this 
fashion. It would soon be apparent that 
the relation established between the 
systems by these operations is not asym- 
metrical, though it is transitive. In other 
words, if the operations are actually 
tried, it will soon become evident that it 
is impossible to obtain both an asym- 
metrical and a transitive relation. In 
short, weight as defined by this operation 
is not a magnitude. The “correct” opera- 


tions are those by means of which the ~ 


necessary relations may be experimen- 
tally demonstrated. The “correct” opera- 
tions may be found if the experimenter 
is ingénious and patient. Furthermore 
some magnitudes now thought to be 
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fundamentally measurable may turn out 
not measurable; others now not measur- 
able will turn out to be measurable, 
when some ingenious experimenter dis- 
covers the “correct’”’ operations. 

>, <, etc., refer to relations that are 
based upon operations for their estab- 
lishment but + refers directly to an 
operation. ( ) also refers to an operation, 
or, perhaps better, to the result of sev- 
eral operations. 

( ) is used in the sense that (A + B) = 
that single system that is equal to the 
combined systems A and B. It is clear 
that () refers to the result of several 
operations one of which is addition. 

The operation for “addition’™ is the 
greatest single stumbling block to 
measurement in physics and psychology. 
In fact the physicists claim that measure- 
ment of sensation must almost always 
fail because the psychologist can hardly 
ever find a proper operation for +. 


Smith (36), Cohen and Nagel (9) and _ 
Johnson (28) have stressed the fact that, 


this is not only true for sensation but 


also for mental testing, the measurement 


of attitudes, etc. 


It will be well worth while to examine 


the objections raised by the physicists, as 
it will shed a good deal of light on the 
criteria for additivity. Comment on the 
application of these criteria to the 
measurement of sensation and to the 
field of mental testing will be withheld 
until a later section. 

The most important criterion that the 
psychologists fail to meet is that of 
“physical, juxtaposition.” An example of 
physical juxtaposition would be placing 
‘the systems end to end in measuring 


“In the following discussion it must’ be re- 
membered that addition may refer to a logical 
concept or to an actual set of physical operations. 
We are interested in additjon in this latter 
sense. When the word is cand in this sense it 
will be set in quotes. : 


_ juxtaposition of equal entities, 
\ sensation- -intensity cannot be measur 
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length; placing the weights on the same 
pan of the balance in measuring weight; 
connecting resistances in series in measur- 
ing electrical resistance, etc. Camp- 
bell (14, 7) would admit that the simul- 
taneous presentation of two auditory 
stimuli to different ears would satisfy 
this criterion of physical juxtaposition. 
So too brilliance might be “added” by 
allowing the light from two sources to 
fall on the same surface. Having met this 
first criterion of physical juxtaposition it 
would then be necessary to show that 
VIII, IX, X and XI held. As John- 
son (28) points out, they certainly do not 
hold for brilliance in all cases. 

It seems that it was this criterion of 
“physical juxtaposition” that was the 
stumbling block to any agreement be- 
tween the physicists and psychologists on 
the Committee of the British Association 
for the Advancement of Science. The 
Committee reported that agreement 
seemed unattainable on the question of 


whether it was possible to make quanti- 


tative estimates of sensory events be- 
cause, to quote Bartlett (8), “If all meas- 
urement must conform to the Laws of 
Measurement enunciated by Dr. Camp- 
bell, and, in particular, if the second 
law can only be satisfied by the physical 
then 


Where has this new requirement for 
measurement come from? It will be re- 
membered that it is not discussed with 
the logical requirements for measure- 
ment. Although this requirement might 
be called an “operational requirement,” 
there -certainly must be some logical 
basis for its inclusion as one of the 
criteria that the operation of addition 
must meet. 

It is impossible to find a clear concise 
statement of what is meant by “addi- 
tion” in any of the publications of the 
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physicists that have been mentioned. It 
is impossible to find a single italicized 
sentence beginning “Addition is .. .” 
However it is possible to combine several 
statements made by Campbell and ob- 
tain a good idea of what he considers 
addition to be. But first it can be said 
that it is obvious that “addition” is no 
single operation that may be applied to 
any and ail systems. The operation of 
adding lengths will not apply to weights. 

Following is a selection of relevant 
phrases from egnax > 


1)... the systems to be measured must 
be capable ‘of a certain kind of combination, 
which will be termed addition .. . (6). 

2)... A+B means the composite system 
formed by combining the systems A and B 
in a particular way; thus, if A and B are 
rigid bodies, A + B may mean the body ob- 
tained by connecting them rigidly (6). 

3) The conditionsthat any proposed form 
of combination must satisfy in order that it 
shall be addition, and shall be suitable for 
the fundamental measurement of any magni- 
tude, can then be expressed in a series of 
propositions involving the symbols +, () and 
>, <, = characteristics of the magnitude (6). 

4) The following are the chief of these 
conditions.12 They are similar to the arith- 
metical “laws” of commutation and distribu- 
tion in addition . . . (Campbell here sets out 
the equivalent of VIII, IX, X, XI [6].) 

5) So much for the properties of Numbers 
in virtue of which addition and subtraction 
are applicable to them. What is the similarity 
between these properties and the properties 
of bodies in respect of weight which enable 
us to apply to weight the process of addition? 
The similarity is between the relation de- 
noted by the sign of addition and a relation 
which can be established experimentally be- 
tween bodies in virtue of the fact that they 
have weight; the propositions which are true 
of one relation are true of the other. .. . 
Then corresponding to the arithmetical prop- 
osition that, if a = b and b = c, then a = ¢, 
we shall state that, if a certain body A bal- 
ances another body B and if B balances 


“T.e., the conditions mentioned in the sen- 
tence above. 


another body C, then A must balance C; 
corresponding to the distributive law, 
a+ (b+ c) = (a+b)+¢c, we shall state 
that if P is a body which balances B and C 
on the same pan and Q a body which bal- 
ances A and B on the same pan, then A and 
P on the same pan must balance C and Q on 
the same (plan, sic) pan; and so on for the 
other laws. 

Now these statements concern experi- 
mental facts; they assert that, in certain cir- 
cumstances, we shall observe something. The 
statements may be true or false; and, as with 
all statements of experimental fact, experi- 
ment only can determine whether they are 
true or false. If they are true there will be a 
certain similarity between the arithmetical 
process of addition and the arithmetical rela- 
tion of equality on the one hand and the 
physical process of addition and the physical 
relation of equality on the other; if they are 
false, there will not be this similarity (5). 

6) The only properties measurable directly 
by means of this rule are those (roughly 
termed quantities) which are additive—that 
is to say, which are such that, given two 
things A and B having the property, it is 
possible to produce by a precisely deter- 
mined operation (combination) a thing C 
which is greater in respect of the property 
than either A or B.. . (6). 


It is possible to gather quite clearly 
from the above quotations that Campbell 
believes that no one operation is neces- 
sarily “addition” but only those opera- 
tions that experimentally fulfill the 
criteria VIII, IX, X and XI. In other 
words, an operation is “additive” if it 
fulfills the criteria for additivity, just 
as a satisfactory operation for producing 
the relation “greater than” is one that 
satisfies I (II) and II (11). 

But still there is no answer to the 
question ‘“‘why the criterion of physical 
juxtaposition?” Since there is no men- 
tion of it in Campbell's treatment of the 


logical requirements, the suspicion 


arises that this criterion rose after the 
physicists had successfully measured a 
number of characteristics of their 
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systems. If the physicists found that the 
operations that fulfilled the necessary 
requirements for additivity seemed to 
involve “physical juxtaposition,” it is 
possible to imagine that they induced 
that “physical juxtaposition” was a 
necessary requirement. It should be 
pointed out however that it is possible 
for this to be true for the systems with 
which the physicists deal without being 
a general law of measurement. 

Campbell says that “Addition is a 
process which is peculiarly characteristic 
of Numbers’(5). Objective number can 
be scaled fundamentally (6). The result 
is the numeral series 1, 2, 3, etc., asso- 
ciated with the objective numbers 1, 2, 
3, etc. Numerals and numbers have be- 
come so inseparable in our thinking that 
statements like this are very confusing. 
Perhaps it might be better to say that 
numerals 1, 2, 3, have been assigned to 
the objective numbers in the following 
groups of objects @ @ @ @ @ @: 
thus, the number of this many objects, 
@. has the numeral 1 assigned to it; 
this many objects, @ @, has the numeral 
2 assigned to it; this many objects, 
@ @ @. has the numeral 3 assigned to 
it, etc. 

The defining operations for establish- 
ing the relation “greater than’’ for ob- 
jective numbers are: 1) select two groups; 
2) pair off object for object; 3) if one 
group is exhausted before the other, the 
group that has not been exhausted is 
termed “greater” with respect to the 
magnitude “objective number.” If both 
groups are exhausted simultaneously 
they are called “equal,” etc. It can 
readily be seen that these operations 
establish an ordinal scale which /fulfills 
all the necessary” criteria. 

But how are the groups added with 
respect to the magnitude objective num- 


ber” when an extensive scale is con- 


THOMAS WHELAN 





REESE 


structed? Suppose we had a group with 
this number of objects, @, called A, and 
have found a group with an equal num- 
ber of objects, @, called A’, and we 
wish to combine these two groups. How 
do we “add” them, @ + @? It is of 
course possible to place them in physical 
juxtaposition. So suppose that we place 
them on the table so that they are touch- 
ing and they look like this, @@. This 
added group, A + A’-is now called B. 
We can go ahead and find a group that 
is equal to B, which we call B’, etc. Now 
it is absurdly obvious that by using this 
definition of “addition” and by using the 
operations outlined above for obtaining 
the relations of >, <, etc., an extensive 
scale can be constructed for the magni- 
tude “objective number.” All of the 
criteria of additivity can be met. 

It is equally absurdly obvious that we 
do not have to place the objects in 
physical juxtaposition to reach the same 
result. We can show that @ + @ = @@. 
that @+ @=@+@, etc. In fact if 
some of the objects happen to be in 
New York and -others in London the 
scale could still be constructed and the 
necessary criteria could be met. It is not 
physical juxtaposition that solves our 
problem, it is the simple fact that the 
operations that have been adopted allow 
us to meet the necessary criteria. 

It may immediately be argued: “It is 
unfair to think of physical juxtaposition 
in such a literal fashion. Does not 
Campbell say that it is possible to place 
one weight in the pan of the balance and 
hang the other weight underneath the 
same pan? It is not literal physical juxta- 
position that is demanded but it is an 
operation that allows the combined 
effect of the magnitude to be exerted in 
one direction. In the case of objective 
number it is true that the scale may be 
constructed if the objects or systems are | 
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not literally placed side by side but it is 
necessary for the two systems to be taken 
as a group, in other words, that the two 
groups B and B’ are paired off against 
the group C as if they were a unitary or 
single group.” 

Suppose this is so, what are the criteria 
for combining the systems? The above 
argument has not answered the problem 
but has restated it. In measuring length, 
why cannot one system be placed on two 
uprights and the other be hung beneath 
it? This would certainly be physical 
juxtaposition and would equally cer- 
tainly meet none of the necessary criteria. 

The answer can only be that the 
operation of placing of one weight in 
the pan and hanging the other under- 
neath the pan meets the necessary logical 
criteria while the operation of placing 
one length between two uprights and 
hanging the other underneath it does not 
meet the necessary criteria. When > is 
defined by the usual set of operations for 
constructing an ordinal scale of length, 
this operation for + would not meet 
the criteria A+ B > A’. 

The suspicion seems to be partially 
confirmed that the physicists have in- 
duced this extra, operational, criterion 
of physical juxtaposition because it has 
commonly occurred in the operations 
they have found it necessary to use in 
order to meet the criteria for additivity. 

It would indeed be curious if physical 
juxtaposition was found to be necessary 
for “addition” in the measurement of 
almost all magnitudes except objective 
number. 

It has been found in discussing this 
point that it was extremely difficult to 
find examples that do not appear 
facetious or absurd. In the example 
quoted above for the measurement of 
length it was asked why one object 
could not be placed on two uprights and 


sd 


the other hung beneath it. This pro- 
cedure seems absurd. It seems like a 
logical contradiction. It is obvious that 
if lengths are to be “added” they should 
be placed end to end. But is it so very 
obvious? The author feels that it is 
obvious only because it is such a well 
known, common, everyday experience. 
If one takes a less familiar example it 
does not appear so absurd. If the ques- 
tion were asked “How should I ‘add’ 
electrical resistance? Should I connect 
the resistance in series or in parallel?” 
the incorrect answer does not appear 
quite so unreasonable. 

If one then applied the same principle 
of physical juxtaposition to the measure- 
ment of inductance and juxtaposed the 
coils and connected them in series-aiding, 
or series-opposing, the result would not 
be so happy. The result of these 
operations for addition would yield 
L, = L, + L, + 2M" for the series aid- 
ing case and L,=L,+L,— 2M for 
the series-opposing case. The strict inter- 
pretation of physical juxtaposition 
breaks down in this case. Yet we know 
that by other operations inductance may 
be measured fundamentally. 

It has been said that A+ B> A’ 
(VIII) must be demonstrated. Again it 
might be argued that it is obvious that 
the magnitude length, defined by the 
appropriate operations for obtaining the 
relations >, < and =, cannot be 
“added” by hanging one object under 
the other. VIII can obviously not be 
demonstrated. But it is this very obvious- 
ness that confounds the thinking con- 
cerning the underlying logic of the prob- 
lem. Really the above method of “addi- 


*L, stands for total inductance series-aiding 
and L, for the total inductance series-opposing. 
L, stands for inductance of coil 2. L, stands for 
inductance of coil 2 and 2M is the mutual in- 
ductance where L, = L,. — 
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tion” should hot be obviously false until 
it has been experimentally demonstrated 
to be false. It is also obvious to the elec- 
trician that-electrical resistance, to be 
“added,” must be placed in series, 
though it may be doubted whether this 
insight is inherited! 

This does not mean that the experi- 
menter may not save himself time, 
energy and embarrassment if he uses his 
intelligence and his past experience in 
selecting an operation for “addition” 
that has some hope for success. But it 
does mean that the final, in fact the only 
test, is an empirical one. Nothing but 
experiment can determine what opera- 
tion shall be “additive” for any given 
magnitude. 

It is now possible to attempt a defini- 
tion of “addition.” Given a magnitude 
previously defined by appropriate opera- 
tions for the establishment of the rela- 
tions >, < and = between the systems 
with respect to this magnitude: “addt- 
tion” is that operation, or series of 
operations, performed upon the systems 
in such a fashion that it is possible to 
meet the logical criteria for additivity 
(VII, IX, X, XI). 

It should be stressed that the same 
operations for demonstrating >, < and 
= in defining the magnitude (construct- 
ing the ordinal scale) must be used in 
demonstrating the criteria for additivity. 
For example, if = is defined in one way 
for the construction of the ordinal scale, 
the same definition must be used in cri- 
teria VIII, IX, X, XI, for additivity, etc. 
Furthermore, as Guild points out (13) 
the same definitions must be applicable 
for all parts of the same scale. There 
cannot be one definition for the smaller 
values of the magnitude and another for 
the larger. 





SECTION D. STEVENS’ POSITION'* 


Stevens defines three terms, numerals, 
numerousness and numerosity. He means 
by numeral a.sign made on a piece of 
paper or an arbitrary symbol, which is 
the definition that has been adopted in 
this study. By numerousness is meant the 
property that “we discriminate when we 
regard a collection of objects.” In this 
sense numerousness might be called 

“subjective number.” “Numerosity is a 
property defined by certain operations 
performed on groups of objects.” By 
numerosity he means what has been 
called objective number in the preceding 
discussion. He uses the term numerosity 
instead of number because he wishes to 
emphasize the difference between nu- 


/meral and number. 


He begins by saying that it is possible 
to establish a rank order of groups of 
objects (beans, for example) in respect 
of numerousness’® simply by looking at 
the piles of beans and judging which of 
the piles is largest, etc. 


“We know from experience, however, that 
greater reliability can be had if we rank- 
order the groups by pairing successively one 
bean from each group until one group is 
exhausted. Then if any beans remain in the 
other group, that group is said to have the 
greater numerosity.”16 “If the pairing ex- 
hausts both groups simultaneously, their nu- 
merosity is equal. . . . Now if we designate 
each of these piles by a separate sign, and 
if we decide to use the same sign to desig- 
nate all groups showing the same numerosity, 
we find ourselves in possession of a series of 
numerals. The ‘spatial’ (topological) order in 
which we write the numerals depends on the 
degree of numerosity designated by each 
numeral, . . . When we regard the numeral 
series as originating in this fashion, we are 


“This summary of Stevens’ “position” is 
gathered from (38), (40), (42). 

* Subjective num 

# Numerosity is here synonymous to what has 
in this paper been called objective number. 
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not astonished to find that it exhibits certain 
important properties. The relations obtain- 
ing among groups, considered from the point 
of view of numerosity are reflected in the re- 
lations obtaining among numerals—but with 
an important difference: degree of numer- 
osity among groups corresponds to ‘spatial’ 
relations in the numeral series. Likewise, the 
numerosity achieved by combining the groups 
(addition) corresponds to the numeral arrived 
at by stepping off two successive ‘distances’ 
along the numeral series, and since the order 
for combining the groups and for stepping 
off the ‘distances’ is immaterial, we can dem- 
onstrate the associative, commutative, and 
distributive laws both for groups of objects 
and for our series of numerals. Furthermore, 
to the extent that we can show transitivity, 
asymmetry, etc., among relations of numer- 
osity, we can also show them among the 
topological ‘spatial’ relations in the numeral 
series. . . .” (40) 


Stevens then mentions three kinds of 


Scales for numerousness. 


1) An ordinal scale could be set up by 
arranging the groups of beans in an 
order so that every group was either 
greater or less or equal to every other 
group. 

2) An intensive scale could be devised 
by prescribing a semantical rule for de- 
termining the assignment of adjacent 
numerals; for example, adjacent nu- 
merals could be assigned to groups that 
showed a just noticeable difference in 
numerousness. 

_~ 3) An extensive or additive scale 
could be constructed by determining 
when one group appeared one half as 
numerous as another. “If this judgment 
could be made, we could then assign to 
the smaller group the numeral lying in 
the numeral series midway from the be- 
ginning of the series and the numeral 
assigned to the larger group. If we 
assigned numerals’ according to this pro- 
cedure, the relations among groups 
exhibiting the property numerousness 


would be reflected in the spatial rela- 
tions of numerals within the numeral 
series’ (40). 

Stevens (38), in 1936, discussing the 
problem of scaling in psychology, claimed 
that the purpose of scaling was to facili- 
tate the description of natural phe- 
nomena by means of functional relations 
expressed, if possible, by the conven- 
tional mathematic symbols. In order to 
accomplish this it is desirable to assign 
numbers? which not only denote the 
order of the systems but also the relative 
magnitude of the phenomena. “When 
this is done, the scale numbers can be 
manipulated in accordance with arith- 
metical laws in order to determine addi- 
tional relationships such as the sum of 
two magnitudes, . . . etc. However, the 
outcome of the purely formal (mathe- 
matical) manipulation of the scale num- 
bers has no significance unless the 
manipulations and their results can be 
identified with some concrete operations. 
First the scale numbers" should be ap- 
plied to the attribute of sensation in 
such a way as to make the scale one of 
true numerical magnitude, which means 
simply that if the numbers are manipu- 
lated according to the rules of arith- 
metic, the result (and the manipulations) 
correspond to a set of physical opera- 
tions. Secondly, although at the outset 
we could conceivably choose any one of 
several sets of operations as defining the 
scale, that set will ultimately prove to 
be most satisfactory for a subjective scale 
when it leads to scale numbers bearing a 
reasonable relationship to the experi- 
ence of the observer.” He continues by 
saying that a scale would be satisfactory 
if the magnitude’® of a particular dis- 
criminable characteristic to which the 


1” What we have here called numerals. 
*T.e., subjective magnitude. 
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numeral 10 had been assigned was half 
as great subjectively as that to which the 
numeral 20 was given and twice as great 
as that to which the numeral 5 was 
given. “With such a scale the operation 
of addition consists of changing the 
stimulus until the observer gives a par- 
ticular response which indicates that a 
given relation of magnitude has been 
achieved.” He says again, “A scale, then, 


W 


JUDGED ONE HALF (phy. units) 
© 


aS es 67 
STANDARD (physical units) 


cate. As this method of scaling will be 
referred to fairly frequently in the sub- 
sequent discussion, it will be outlined 
here in some detail. Other accounts may 
be found in 39, 40, 42. 

In a typical case the subject: is pre- 
sented with a standard stimulus and 
adjusts a variable stimulus until it 
appears to be half the subjective magni- 
tude of the standard. These 1/2 judg- 
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Fic. 1. Hypothetical half judgment function. 


which would enable us to designate the 
numerical as well as the intensive mag- 
nitude of an attribute of sensation can 
be constructed according to the criterion 
that, having assigned a particular num- 
ber N to a given magnitude, the number 
N/2z shall be assigned to the magnitude 
which appears half as great to the ex- 
periencing individual.” 

The actual method of assigning 
numerals in order to construct a scale of 
this sort is somewhat more confusing 
than the above statement would indi- 


ments are obtained for a number of 
standards which have been selected to 
cover a wide range of physical magni- 
tudes. It is then possible to plot the 
stimuli judged 1/2 against the standard 
stimuli. The units used in this plot are 
physical measures of the stimuli, such 
as frequency, centimeters, etc. A curve is 
then fitted to the obtained points, This 
plot is referred to hereafter as the half 
judgment function. Figure 1 shows such 
a plot for a group of imaginary data. In 
this figure the stimuli judged 1/2 are 





APPLICATION OF PHYSICAL MEASUREMENT TO PSYCHOLOGICAL MAGNITUDES 23 


plotted, in physical units, against their 
respective standards, also measured in 
physical units. 

The magnitude function or scale is 
constructed from this plot in the follow- 
ing way: ° 

1) A numeral is arbitrarily chosen 
and assigned to some subjective magni- 
tude associated with some arbitrarily 
selected stimulus. In this case the nu- 
meral 1 has been arbitrarily chosen and 
‘has been arbitrarily assigned to represent 
the magnitude of the discriminable 
characteristic associated with the stimu- 
lus physical magnitude of 1. This fixes 
the first point of the magnitude function 
(Fig. 2), indicated by a cross. 

2) Returning to Figure 1 it is seen 
that a stimulus of 1 physical unit was 
judged by the subject to be 1/2 the 
subjective magnitude of a stimulus of 
1.75 physical units. Jn other words the 
magnitude of the discriminable charac- 
teristic associated with a stimulus of 1 
physical unit is judged to be 1/2 as 
great as the magnitude of the discrimi- 
nable characteristic associated with a 
stimulus of 1.75 physical units. 

Since the magnitude of the discrimi- 
nable characteristic associated with a 
stimulus of 1 physical unit is judged to 
be 1/2 the magnitude of that associated 
with one of 1.75, and since the magni- 
tude of the discriminable characteristic 
associated with a stimulus of 1 has been 
assigned the numeral 1, then the nu- 
meral 2 (2 subjective units) is assigned 
to the magnitude of the discriminable 
characteristic associated with a stimulus 
of 1.75 physical units. This point is then 
plotted. (Point A, Fig. 2). 

3) Returning to Figure 1, it is neces- 
sary to find of what discriminable char- 
acteristic the discriminable character- 
istic associated with the stimulus of 1.75 
physical units is judged to be 1/2. It is 


seen that the discriminable character- 
istic associated with the stimulus of 1.75 
physical units is judged to be 1/2 of the 
subjective magnitude of the discrimi- 
nable characteristic associated with a 
stimulus of 2.85 physical units. 


SUBJECTIVE UNITS 


PHYS/CAL UNITS 


Fic. 2. The magnitude function constructed 
from the hypothetical half judgment function 
presented in Figure 1 (arithmetic coordinates). 


Since the magnitude of the discrimi- 
nable characteristic associated with a 
stimulus of 1.75 physical units has been 
judged to be 1/2 the magnitude of one 
of 2.85 physical units, the numeral 
assigned to the stimulus of 2.85 units 
must be twice the numeral assigned to 
represent the magnitude of the dis- 
criminable characteristic associated with 
a stimulus of 1.75 physical units. ‘Iwo 


subjective units (the numeral 2) have 
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been assigned to the magnitude of the 
discriminable aspect associated with a 
stimulus of 1.75 physical units so 4 sub- 
jective units are assigned to represent 
the magnitude of the discriminable 
aspect associated with a stimulus of 2.85 
physical units, point B in Figure 2. This 
process is continued for points C, D, 
etc., until the scale is complete. 

There are several important things to 
note: 

1) A smooth curve is always obtained 
in the magnitude function. This is neces- 
sarily so because it is constructed from a 
smooth curve fitted to the data in the 
half judgment function. 

2) The magnitude plot and the half 
judgment function should be extrapo- 
lated only with extreme caution. In 
fact, if only a limited section of the 
stimulus range has been explored, they 
should not be extrapolated at all. The 
reason for this statement, based on the 
author’s limited experience with these 
plots, is that the half judgment function 
usually changes slope as the upper and 
lower thresholds are approached. 

3) Interpolation is obviously neces- 
sary, as it would be impossible to obtain 
1/2 judgments for every stimulus value. 
Values closer together than 1 j.n.d. 
would not add anything to the accuracy 
of the graph. 

There can be no rule for determining 
the number of stimuli used. Common 
sense can be the only judge. It should 
be clear, however, that the stimuli should 
be closer together at critical points on 
the curve. Critical points would be those 
where the curve changes slope rapidly 
or near the point of break in a discon- 
tinuous function, 

4) In the example of scale construc- 
tion used above, the arbitrary starting 
point was the lowest stimulus value. ‘The 
magnitude plot could have been ob- 


tained by arbitrarily starting at the 
highest stimulus value or even by starting 
with a stimulus value that is in the mid- 
dle of the stimulus range. This latter 
procedure was adopted by Stevens and 
Volkmann (42). 

If the arbitrary starting point is not at 
the bottom of the stimulus range a 
slightly different procedure must be 
adopted in constructing the magnitude 
plot. 

Suppose a stimulus of 4.60 physical 
units had been chosen for the arbitrary 
starting point, then: 

1) A numeral is arbitrarily chosen, as 
before, and assigned to the magnitude 
of the discriminable characteristic asso- 
ciated with a stimulus of 4.6 physical 
units. Assume that the numeral 8 has 
been assigned to the stimulus of 4.6. 

All the numerals for stimuli above 4.6 
physical units are obtained as outlined 
above. Numerals for stimuli that are 
lower than 4.6 physical units are ob- 
tained by asking the question, “What 
stimulus was judged to be 1/2 of 4.6?” 
To answer this question it is necessary 
to go to Figure 1 and go out the abscissa 
to 4.6 and up the ordinate to the value 
of the stimulus judged 1/2 which in this 
case is 2.85. 

Since the discriminable characteristic 
associated with a stimulus of 4.60 physi- 
cal units was assigned 8 subjective units, 
then the magnitude of the discriminable 
characteristic associated with 2.85 should 
be assigned 4 subjective units. 

The question is then asked, “What 
stimulus was judged to be 1/2 of the 
stimulus of 2.85 subjective units?” and 
the scale is continued in this fashion 
until complete. 

In this example, where the numeral 8 
is assigned to the stimulus 4.6, the re- 
sulting scale would be identical with the 
scale constructed previously with the 
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arbitrary starting point at the bottom of 
the stimulus range, i.e., with the numeral 
1 assigned to discriminable characteristic 
associated with the stimulus of 1 physical 
unit. The reason for this is that the 
numeral 8 is the numeral that would 
have been assigned to the stimulus 4.6 
if the scale had been started from the 
bottom (stimulus 1) with an arbitrarily 
chosen numeral of 1. If any other nu- 
meral than 8 is assigned to the stimulus 
4.60 the two scales will not be identical. 
However the relations between the 
stimuli as expressed by the scale nu- 
merals will be the same no matter what 
the arbitrary starting point or what 
numeral ts assigned to it. 

There is another set of operations used 
for constructing a magnitude function in 
psychology when the data have been ob- 
tained by the method of equal appearing 
intervals. The method of equal appear- 
ing intervals itself is too well known to 
require description here. But the tech- 
nique of construction of the function 
after the data have been obtained dif- 
fers from the method used for fractiona- 
tion data. Although mention will be 
made of the method of equal appearing 
intervals in the present paper, no use 
will be made of the technique for the 
construction of the magnitude function 
with data obtained from the method. 
The reader is referred to Stevens and 
Volkman (42) where a good description 
of the method may be found. 

Stevens and Volkmann (42) argue 
that the results obtained by this method 
should check the results obtained by the 
method of fractionation. This of course 
does not mean that the values (nu- 
merals or units of subjective magnitude) 
assigned to any stimulus by the scaling 
technique employed after the use of the 
method of fractionation should be 
identical with the values assigned to the 


same stimulus after the use of the method 
of equal appearing intervals. It does 
mean that the relations between stimuli 
as expressed by one set of numerals 
should be the same as the relations be- 
tween stimuli as expressed by the other 
set of numerals. 


SECTION E. CRITICISMS OF PSYCHOLOGI- 
CAL MEASUREMENT BY THE 
PHYSICISTS 


There are four basic methods by 
which measurement has been attempted 
in psychology, the method involving the 
integration of j.n.d.’s, the method of 
equal appearing intervals, the method of 
fractionation and the various techniques 
based on the normal probability curve. 

The objections and criticisms of the 
physicists will be briefly outlined for 
each of these methods in turn:’® 


1) The Integration of j.n.d.’s 


a) The scale constructed from the 
integration of j.n.d.’s can not measure 
an A-magnitude, as the defining relations 
involve the measurement of another 
magnitude, namely stimulus intensity. 
The defining relations for an A-magni- 
tude must be independent of all other 
quantitative relations for other magni- 
tudes. Furthermore the only way in 
which the scale could measure a B- 
magnitude is to “define S by a postu- 
lated relation to I.’ The scale has some 
of the properties of both A and B mag- 
nitudes but has neither the necessary nor 
the sufficient properties of either (13). 

b) Fechner assumed that all j.n.d.’s 
are equal. What are the actual specified 
relations between j.n.d.’s? /\ S, is the 
sensation increment associated with a 
j-n.d. at stimulus intensity I, and A S, 
is the sensation increment associated with 


* This summary of the physicists’ criticism 
has been taken chiefly from (7), (13), (14). 
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the j.n.d. at intensity I,. This statement 
is the only specified relation between 
A S, and A S,. This relation is not 
symmetrical, as it ceases to be true when 
/\ S, and A S, are interchanged. But to 
establish equality the relation must be 
shown to be symmetrical. “This one con- 
sideration alone renders superfluous all 
the semimetaphysical arguments which 
have centered round the question 
whether or not equal, in the sense of 
equally noticeable, necessarily means 
‘really’ equal. A symmetrical transitive 
relation is essential as a practical cri- 
terion of equality in measurement” (19). 

c) The criterion of equality is not 
applicable throughout the scale. It is 
meaningless to say that 
S — AS, + AS, 
because = in this case has a meaning 
different from that used for equality of 
different A S's (13). 

There seems to be no reasonable de- 
fense against these arguments. 


2) The Method of Equal Appearing 
Intervals 


The chief arguments presented by the 
physicists against the method of equal 
appearing intervals are: 

a) The magnitude obtained by the 
method of equal sense distances is not 
the same magnitude as sensation inten- 
sity. The operations for finding equal 
sense distances involve 3 or more stimuli 
of different apparent intensities while 
those for finding equality of sensation 
intensity involve two apparently equal 
stimuli, The operations for obtaining 
equality are different in the two cases, 
therefore the magnitudes are also dif- 
ferent (19). 

b) Nor, they argue, can it be stated 
in rebuttal that the above objection 
applies with equal validity to any mag- 
nitude. While it is true that it is ;im- 


possible to obtain equal differences of 
length without at least 3 objects of un- 
equal length, the analogy is not valid. 
As Guild says, “Difference of lengths as 
something expressible on a quantitative 
scale derives its significance from the 
association of number and length estab- 
lished by the _ practical criteria of 
equality and addition which define 
length as a magnitude. It merely means 
the length which must be added to the 
smaller of two lengths in order to make 
a new length equal to the larger of the 
original pair. We cannot define a process 
of subtraction independently of a proc- 
ess of addition. We cannot construct a 
scale of length from units of difference- 
of-length defined by operations other 
than those involved in defining equality 
and addition for length. Similarly we 
cannot give any quantitative significance 
to difference-of-sensation-intensity unless 
we already have practical criteria both of 
equality and addition for sensation in- 
tensity; for all that difference-of-sensa- 
tion-intensity means, if it means any- 
thing, is the sensation intensity which, 
when added to the smaller of two given 
sensation intensities, will produce a new 
intensity equal to the larger” (19). 

c) The proposed criterion of equality 
is inadequate, as a symmetrical, transi- 
tive relation cannot be demonstrated. 

This last objection seems to be de- 
cidedly ill-founded. Given the four 
stimuli A, B, C, D, marking off the three 
sense distances, AB, BC, and CD, 


A B C D 





and given the fact that BC and CD have 
been judged equal, it can be shown, 
obviously, that BC } CD and BC ¢{ CD 
(III); also if BC is judged to be > AB, 
it is more than likely that CD will be 
judged > AB (IV). Given the case below 
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where BC and CD had also been judged 
as equal, 


A B C D 





it is more than likely that if BC is 
judged < AB, CD will also be judged 
< AB (V). 

The author believes that two questions 
are raised by objection b. The first ques- 
tion is whether the observer can actually 
equate sense distances. The second ques- 
tion concerns the interpretation to be 
placed on the results if he can equate 
them. 

What is the answer to the first ques- 
tion? How do we know that the ob- 
server can equate sense distances? Can 
this equation have any meaning apart 
from the operations used to define addi- 
tion? It can, for the operations for ob- 
taining the relation = are certainly not 
dependent upon the operations for de- 
fining addition. The criteria for = are 
mentioned under III, IV and V. These 
criteria may be shown to hold for the 
relation of = established by the method 
of equal sense distances. There can be 
no possible objection to the statement 
that the sense distances are equal, for 
the established equality meets all the 
necessary logical criteria for equality. 

Why, then, is objection 0 raised at 
all? The difficulty seems to be with the 
word “difference.” The physicists argue 
that a difference may not be defined 
apart from addition, i.e., the difference 
between A and B is that amount that 
must be added to B to produce A. In 
other words if B+X=A,_ then 
A—B=X, but they argue that this 
second proposition has no meaning apart 
from the first proposition. Since ad- 
ditivity has not been demonstrated, the 
concept of “difference” has no meaning 
in the equal sense distance experiment. 


It seems to the author that this criti- 
cism is due to a certain amount of verbal 
confusion. It is true that the word differ- 
ence is sometimes used to describe the 
sense distances equated by the observer, 
but this does not imply that the observer 
need perform two mental subtractions 
and additions in making his judgment. 
It is possible to call the interval what- 
ever one wishes but this does not change 
the fact that the observer has the straight 
forward task of equating the intervals, 








A 
O 
B 
O ; 
C 
O 1 





AB and BC, so that AB = BC.” He is 
perfectly capable of making this judg- 
ment with some consistency and the re- 
sulting equation satisfies all the criteria 
for equality. 

But even if it is granted, and the 
author believes it must be, that the ob- 
server can make this judgment of equal- 
ity in such a way that all of the criteria 
for equality are satisfied, all of the 
physicists’ objections are not answered. 

The psychologist is not interested in 
the simple result AB = BC. This result 
is but a means to an end. He desires to 
construct a scale, he wants to be able to 
say that if A = O (the absolute thresh- 
old) and the numeral 1 is assigned to 
B, the numeral 2 must be assigned to C. 
A 
O 

B 
O 





: C 
O i 





” This is not a new viewpoint. It seems to the 
author that it is essentially the same as Del- 
boeuf’s Contrastes Sensibles. 
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He reasons that this is possible be- 
cause C is as “far” from B as B is from 
A. But the physicist would argue that 
the numerals so assigned cannot be 
interpreted in the conventional fash- 
ion unless it is shown that AB + BC = 
A.C, 

This objection seems far-fetched be- 
cause B was so adjusted that AB = BC 
and it should follow logically that B 
bisects AC and as a consequence, since 
two halves make a whole, AB + BC must 
equal AC. But the criteria for measure- 
ment are not only logical, they art prac- 
tical. “Phe statement, that AB = BC, 
therefore AB = 1/2 AC, and therefore 
AB + BC = AC is a sound logical de- 
duction but it may not correspond to any 
demonstrable relation between the 
stimuli in question. Although it is logi- 
cal that the two halves make up the 
whole, it may not be experimentally 
demonstrable in a given situation. For 
example, suppose the introduction of 
the stimulus B increased the subjective 
magnitude of the distance AC. If the 
numeral 1 was assigned to B and the 
numeral 2 was assigned to C, the magni- 
tude of C presented alone would be 
misrepresented by the numeral 2 as- 
signed to it, e.g., suppose the observer is 
asked to adjust a stimulus B so that 
AB = BC when the stimulus A has O 
objective magnitude, 


A 
O 


O ' 





The line C has associated with its 


physical length a subjective length which 
may be called X. 

Now suppose the observer begins to 
draw the line B which has a subjective 
length Y associated with it. 


A 
O 
O i 





C 
O ; 





The observer is to make the equation 
OY = YX. The subjective magnitude of 
C may be increased because of a contrast 
effect. This new subjective magnitude of 
C may be called X + e. 

Now when the observer adjusts B so 
that OY = YX he will be actually adjust- 
ing B so that OY = Y (X +e). If the 
numeral 1 is now assigned to Y, the nu- 
meral 2 cannot be assigned to the sub- 
jective magnitude of C, ie. to X. It 
must be assigned to X + e. 

It can not be assumed that subjective- 
ly AB + BC = AC simply because AB 
= BC. It must be demonstrated. 

Objection a might be answered em- 
pirically. It was seen that properties that 
have the same order are considered to 
be magnitudes of the same kind. If, then, 
it can be shown that the order resulting 
from one series of operations for obtain- 
ing>, < and =, is the same as the or- 
der resulting from another definition of 
>, < and =, it can be said that the two 
sets of operations measure magnitudes 
of the same kind. It can be confidently 
predicted that, for most subjective mag- 


nitudes, the results of an empirical in- 


vestigation would show objection a to 
be ill-founded. 


3) Fractionation 

a) The first objection to the method 
of fractionation is similar to objection b 
raised against the method of equal ap- 
pearing intervals. It is not possible to 
define 4 without reference to addition. 
The observer should not be entitled to 
use the word ¥f, if the facts from which 
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the word derives its significance do not 
exist in the sphere of discourse (14). 

Furthermore, granting that the observ- 
er can adjust B = 14C, he has not dem- 
onstrated that B + B’=C. This can 
only be done if an operation for addi- 
tion that meets all the necessary criteria 
has beea found. 

b) Campbell says that the monaural- 
binaural method of scaling loudness sat- 
isfies the criterion of additivity (7, 14). 
The only reason for comparing the scale 
obtained by fractionation with the scale 
obtained by this method” is to see 
whether the observer can guess 1/4 cor- 
rectly. Even if the observer could guess 
4 correctly, no one would abandon the 
exact physical measurement of magni- 
tudes in favor of measurement based on 
guesses. 

As in objection b raised against the 
method of equal appearing intervals, 
there are two points at issue in objec- 
tion a. 

The first is, can there be any meaning 
to the operation of halving unless addi- 
tivity has been shown? The second ques- 
tion is, assuming that the 14 judgment 
has a meaning apart from addition, do 
the assigned numerals give an extensive 
scale? 

To answer the second.question first, 
it would seem that it is necessary to re- 
member that the assigned numerals can 
not express any relation that has not 
been demonstrated empirically. Stevens 
says, “If we assigned numerals according 
to this procedure,” the relations among 
group exhibiting numerousness*® would 
be reflected in the ‘spatial’ relations of 


*He is discussing the comparison made in 
Stevens and Davis (39a). 

* T.e., the procedure used after a fractionation 
experiment. 

* L.e., subjective number. 


numerals within the numeral series.” 
This is true. But the relation B + B’ = 
C is not one of the relations demon- 
strated by the fractionation experiment. 
By the very nature of the experiment the 
only relation claimed to be demon- 
strated is B= 1/2C. 

Certainly no objection can be raised 
to Stevens’ theoretical position concern- 
ing the origin of the numeral series and 
the criteria that must be met to be able 
to obtain an additive scale. Nor can 
there be any objection to the statement 
quoted immediately above. The only 
question at issue is whether the scale 
constructed by the method used by him 
can be accepted as additive when the 


additive nature of the magnitude has not: 


been demonstrated. The magnitude may 
be additive, but it has not been shown 
to be so. 

In answer to the first question, it is 
necessary to note that there are certain 
similarities and differences between the 
method of equal appearing intervals and 
the method of fractionation. In order 
to examine these it might be helpful 
to diagram the method of equal appear- 
ing intervals in a way different from 
that used above when stimulus A = O. 


O A 

O B 

O C 
This diagram gives a somewhat clearer 


picture of the true situation. These are 
three stimulus intensities; one, A, has 











_ the absolute magnitude represented by 


the distance from zero (the absolute 
threshold); the second, B, has a greater 
absolute magnitude, represented by the 
distance from zero to B; and the third, 
C, a still greater absolute magnitude, 
represented by the distance from zero to 
C. 





— 
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The fractionation experiment may be 
diagrammed as follows: 


O B 
O C 








The important difference to note is 
that in the fractionation experiment the 
stimulus A is zero, Aside from this dif- 
ference it is immediately apparent that 
the two situations are strikingly similar. 
In fact it may be said that the stimulus 
situation in the fractionation experi- 
ment is a special case of the equal ap- 
pearing intervals experiment. 

What is stimulus A which is zero? It 
simply means that no stimulus is given, 
it being assumed that the observer has 
some idea of what zero magnitude is 
like. Stevens and Volkmann (42), in a 
successful attempt to improve the relia- 
bility of their results for the pitch func- 
tion, introduced a stimulus of zero pitch. 
It might seem that the presentation of 
a stimulus of zero magnitude must mean 
the presentation of no stimulus at all. 
This is not necessarily true. The experi- 
menter does not present a stimulus of 
zero physical magnitude; he presents a 
stimulus of such an intensity that the 
characteristic being scaled has zero sub- 
jective magnitude. There may be other 
characteristics of the stimulus that have 
a subjective magnitude above zero. But 
still it would seem that, in effect, the 
experimenter was presenting a charac- 
teristic that was subjectively non-exist- 
ent; hence, so far as that characteristic 
was concerned, he might just as well 
have presented no stimulus at all. The 
observer is no better off than if he had 
supplied his own idea of zero. This is 
true. In practice, then, the zero stimulus 
is not really a zero stimulus but one 
that is just noticeably above zero. This 
procedure was followed by Stevens and 


Volkmann (42) and was found to be 
helpful for some of the observers. 

It should be noted that the “almost 
zero” stimulus was not presented to the 
observer in the usual way. In reality it 
was available for him to use if he needed 
it, in order to clarify his idea of what 
zero was like. He could turn it on or off 
at will. 

It may be argued that this procedure 
of allowing the observer to hear a zero 
characteristic that is not really zero will 
introduce a constant error into the re- 
sults, Stevens and Volkmann recognize 
this possibility but argue that the effect 
of this error would be negligible for the 
high tones, and the subject was not al- 
lowed to use this “almost zero” stimu- 
lus for the fractionation of the lowest 
tone. Furthermore, when the almost zero 


stimulus is used, the resulting increase 


in accuracy will undoubtedly more than 
compensate for any small deviation 
caused by its introduction. Also the pos- 
sibility of a constant error will be re- 
duced progressively as the “almost zero” 
stimulus approaches the absolute thresh- 
old. 

Stevens and Volkmann (42) point out 
the possibility that the introduction of 
a zero stimulus may turn the fractiona- 
tion experiment into an equal appear- 
ing intervals experiment. They reason 
that logically the fractionation to one- 
half is equivalent to the bisection of 
some higher point and zero. They say, 
however, that in their experiment the 
fractionation procedure differed in two 
important respects from the equal ap- 
pearing intervals experiment. The first 
difference is that the observer was not 
given the A stimulus but it was merely 
available if he wished to use it; the sec- 
ond difference is that the observer was 
instructed to use the “almost zero” tone 
(A stimulus) as a reference tone and not 
“as a limiting tone in a self imposed task 
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of bisection” (42). The second difference - 


would seem to be the more important, 
for it seems that even if there is no A 
stimulus presented at all, the subject 
may still make an equal appearing inter- 
val experiment out of the fractionation 
experiment by self-instruction. It is true 
that what the experimenter does is im- 
portant but it also is true that what the 
subject does is very often more impor- 
tant. Even if the experimenter does not 
allow the observer to use an A stimulus, 
the observer may still equate the inter- 
vals AB and BC. He can supply his own 
A stimulus. It must always be remem- 
bered that there are always two kinds of 
operations in an experiment of this sort, 
those that are under the direct control 
of the experimenter and those that are 
under the direct control of the observer. 
No matter what the experimenter does, 
the operations of the observer can often 
determine the nature of the experiment. 
It can, in fact, be stated that the most 
important differences between these two 
methods are those that depend on the 
operations not under the direct control 
of the experimenter. 

It will be worth while to examine the 
difference between the two experiments 
in more detail. 

In the equal appearing intervals ex- 
periment the observer, 

1) Must disregard the absolute mag- 
nitude of the stimuli. 

2) Equate the distances AB and BC. 

The question as to whether the ob- 
server can halve or bisect the distance 
AC is the same question as whether he 
can halve the stimulus in the fractiona- 
tion experiment. 

In the fractionation experiment the 
observer might do one of three things: 

1) He might equate the distance be- 
tween AB (when A = O), and BC. 

2) He might set B to one-half the ab- 
solute magnitude of C (this is what he 


is told to do in the instructions). 

3) He might adjust the absolute mag- 
nitude of B so that B+ B=C. 

If the observer chooses the first of 
these possibilities he automatically per- 
forms the operations inherent in the 
method of equal appearing intervals. 

With respect to the second possibility, 
can the observer find 14 the absolute 
magnitude of C? This is essentially the 
question raised in objection a. 

The author believes that the observer 
can do this in only two ways. The first 
way is to define 14 C as that position of 
B when AB = BC. In other words, the 
observer can perform this task when he 
changes it into an equal appearing in- 
tervals task. ‘The second way is to define 
4 as that position of B that allows the 
absolute magnitude of B added to itself 
to equal C. In other words, B + B =C. 
This is the third operation mentioned 


' above. In other words, it seems that the 


concept of 14 can have no meaning apart 
from these operations. But the physicists 
would argue that it has no meaning 
other than that which is dependent on 
addition. 

It is, then, necessary to examine the 
fractionation method of obtaining half 
to see if the 14 so defined is independent 
of addition. 

It is true that the observer might de- 
fine half as that position of B that di- 
vides the distance AC into halves. But 
what can the observer actually do to 
obtain this relation? All he can do is to 
equate AB and BC. He makes judg- 
ments of “difference” and “no differ- 
ence” on the AB and BC intervals. This 
has nothing to do with “halving.” It is 
the operation for obtaining equality. But 
where then. does the half concept arise? 
It arises because the observer knows that 
a point that bisects a distance logically 
divides that distance into two halves. 
But not only is a half one of those two 
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equal parts into which a whole may be 
divided, it is also one of the two equal 
parts which actually add up to make the 
whole. And the argument is right back 
at the beginning. The concept 14 cannot 
be defined apart from addition. It is 
true that the observer can make the logi- 
cal deduction that if the whole is di- 
vided into two equal parts the two parts 
must be equal to the whole. Again, how- 
ever, we are up against the fact that the 
criteria for measurement are practical; 
they are not assumptions, The relations 
between the systems must be demon- 
strated facts. The statement that B = 
4 C is meaningless unless it can be 
demonstrated that B + B= C. 

The only conclusion that can _ be 
drawn, then, is that the fractionation ex- 
periment becomes an equal appearing 
intervals experiment if the observer de- 
fines 4 as that position of B which 
makes AB = BC. The results must be 
interpreted in the same way as those ob- 
tained by the equal sense distance meth- 
od and they are subject to the same lim- 
itations. 

Suppose the observer defines 1/4 as that 
magnitude of B which fulfills the con- 
ditions that B + B = C. It must be noted 
that this definition can only be made 
with reference to the absolute subjective 
magnitude of the stimulus B. If the ab- 
solute magnitude of B added to itself 
equals C, then one can say that B = 4 
C. It is impossible to define 14 by adding 
the sense distances rather than the abso- 
lute magnitudes, for if AB + BC = AC, 
it cannot. be deduced that B bisects the 
distance AC unless it has previously been 
shown that AB = BC. There is some evi- 
dence that this definition may be used. 
One observer has reported that he has 
used this definition of 14 or, rather, 
it is truer to say that he used both 
definitions of 14. This observer reported 


j 





REESE 


that he checked one definition against 
the other, i.e., he would equate the dis- 
tances AB and BC and then to check 
the equation .would ask himself the 
question, “Does the absolute magnitude 
of B + B = C?” It may be that some 
observers have as much difficulty defin- 
ing Y% apart from addition as the logi- 
cians think they should have. 

The fact that even one observer has 
reported a subjective additive operation 
is extremely interesting. In so far as any 
observer uses the additive definition of 
l% the objection that 4 C=B does not 
mean that B + B = C loses its validity. 
This does not mean that the fractiona- 
tion method gives us an additive scale. 
To obtain an additive scale it is neces- 
sary that the subjective operation of ad- 
dition used by the observer meet the 
necessary criteria and, furthermore, it is 
of course necessary to be certain that the 
observer actually does use the additive 
definition of 14 and not the equality defi- 
nition. Although an additive scale has 
not yet been demonstrated, the way 
seems to be open by which the psycholo- 
gist may attempt to demonstrate addi- 
tivity for the magnitudes that he “meas- 
ures.” He is free of the false criterion of 
“physical juxtaposition” and may go 
about his business of attempting to find 
those operations that will meet the logi- 
cal criteria for measurement, 

There remains one further possibility 
to be discussed. That is the possibility 
of additivity in the equal appearing in- 
tervals experiment when A is not O. It 
is now obvious that the observer cannot 
halve or bisect the sense distances with 
any meaning unless be adopts an additive 
process. He can, of course, equate the 
distances but as has been seen this does 
not of itself give an additive scale. If the 
observer were to adopt an additive op- 
eration in the equal appearing intervals 
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experiment, he would have to restrict 
himself in the following manner, “If B 
is to bisect the distance AC, it would 
mean that the absolute difference AB 
added to the absolute magnitude of B 
should equal C.” This is obviously an 
elaborate and confusing instruction. The 
fractionation technique offers a simpler 
way of obtaining the same results. Fur- 
thermore we find the word “differences” 
creeping into the instruction and it was 
seen that additivity must be shown be- 
fore the concept of difference can have 
any meaning. 

Stevens and Volkmann (42) compared 
the results obtained by the fractionation 
method and the method of equal appear- 
ing intervals and found that the two 
methods yielded essentially the same 
scale. It is, of course, not known whether 
the observers in the Stevens and Volk- 
mann experiment used the equality defi- 
nition of Y or the additive definition. 
One suspects that they used the equality 
definition, as this is the easier for the 
observer. If they did it is not surprising 
that the two scales checked each other as 
they were constructed by the same series 
of operations. It is true that the two 
methods might yield different results if 
the two sets of instructions operated dif- 
ferentially in producing either compli- 
cated or erroneous self-instructions or 
constant errors depending on the pres- 
ence or absence of a zero stimulus. _ 

If the analysis of the operations in- 
volved in the equal appearing intervals 
experiment and the fractionation experi- 
ment is correct, the lack of reliability of 
fractionation to 14 or 1/10 would be 
easily explained. If the observer is given 
a sense distance A ————— N which he 
is to divide into 5 equal appearing inter- 
vals, he can perform this task with some 
ease. If he is asked to fractionate a mag- 
nitude to 1/5, the task is difficult and 


the results do not check the results ob- 
tained by the other methods. In the 
equal appearing interval task the ob- 
server must equate 5 intervals all of 
which are given, i.e., the stimuli are un- 
der his control and he can compare every 
interval with every other one directly. 
In the fractionation to 1/5 this is not 
possible. If the above analysis is correct, 
the observer must either find that stimu- 
lus which when added to itself 4 times 
would give N; or, he must adjust B so 
that the interval AB is equal to 4 other 
intervals between A and N, none of 
which are given! One should certainly 
not expect very accurate or reliable re- 
sults from such a procedure. 

Objection 6 raised by Campbell is not 
really important. If the fractionation 
scale is not valid because additivity has 
not been demonstrated, the objection is 
superfluous. If additivity has been dem- 
onstrated, and the scale constructed by 
fractionation, there is no reason why the 
two scales should give the same results. 
In fact from Campbell’s discussion it 
might be assumed that they would not. 
Campbell says that properties that have 
the same order are the same magnitude 
or are magnitudes of the same kind (6). 
However when more than order has been 
demonstrated, in other words when an 
additive scale has been constructed, this 
statement may be limited. Campbell says 
“, .. the identity of magnitudes (or at 
least of such magnitudes as are suscepti- 
ble to fundamental measurement) arises 
from similarity of addition, and this sug- 
gestion is correct; magnitudes are the 
same, however greatly they may differ in 
the relation >, if an operation + can 
be found which satisfies the conditions 
of addition for all of them’’ (6). 


4) Statistical Methods of Scaling 
The author does not intend to go into 
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any great detail in discussing the objec- 
tions raised against the statistical scaling 
methods. An excellent critique may be 
found in Smith (36).** There are, how- 
ever, one or two important points that 
might be made. 

It is possible to define such a magni- 
tude as difficulty of mental test items 
in terms of the percentage of persons 
solving the items. Two items may be de- 
fined as equal in respect of difficulty if 
they are passed by the same percentage 


of people. An item may be defined as \ 


more difficult than another item if it is 
solved by a smaller percentage of people. 
Likewise it may be defined as less diff- 
cult if it is passed by a greater percent- 
age of people. By the use of these defini- 
tions and by performing the necessary 
operations upon a group of items, the 
items may be arranged in an order of 
difficulty. The experimenter is in posses- 
sion of an ordinal scale. He has not es- 
tablished equal units nor has he demon- 
strated additivity. 

Psychologists have been very anxious 
to demonstrate that they could obtain 
equal units of difficulty for measuring 
mental test items. ‘They thought that the 
demonstration of equality of units al- 
lowed them to add and subtract the nu- 
merals representing the items. This, as 
has been shown, is not true. Not only is 
it necessary to demonstrate the equality 
of the units, but it is also necessary to 
demonstrate that the magnitude is 
susceptible to addition. 

Several methods of manipulating test 
results have been devised. One common 
method consists of translating the per- 
centages of the attained distribution into 
units of the base line of the normal 
curve. These base line units are con- 
sidered equal. It is true, of course, ‘that 


* See also Peatman (39). 


the units of the base line are geometri- 
cally equal. But still it has not been dem- 
onstrated that these units that are geo- 
metrically equal correspond to equal 
units of difficulty. One unit distance 
along the base line is equal to any other 
unit distance. This does not mean that 
the distance in difficulty between the test 
items that these units represent is also 
equal. This relation is assumed. There 
is no experimental operation demon- 
strating this equality. Experimentation 
may demonstrate a relation but mathe- 
matical manipulation cannot create one. 

Smith gives a very useful example, 
“Let us consider a normal distribution 
of people, with respect to height. It is 
true that equal units of the base line of 
the curve will mark off equal units of 
this quality. For example, if the mean 
height is 60 inches, and sigma 1 is 5 
inches, then it is true that the individuals 
whose frequencies place them at sigma 1 
will be 65 or 55 inches high, depending 
upon whether the sigma is plus or minus. 
It is also true that the frequencies may 
be manipulated to obtain units of height, 
just as they are used to obtain units of 
learning. But quantitative units of height 
are not established by this procedure. 
Rather, sigma 1 has a quantitative mean- 
ing because height has been shown in 
quite another context to be additive. 
Apart from the fact that units of length 
had already been ascertained on opera- 
tional grounds, there would be no reason 
to assert that the sigma marked off a 
unit of height in a fundamental and ad- 
ditive sense. It is the fact that height has 
been measured fundamentally, and in- 
dependently of its career in a normal 
distribution, that gives quantitative 
meaning to a segment of the base line” 
(36). 

The psychologist who is attempting to 
measure difficulty is placed in a position 


* 
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similar to that of the physicist who 
wishes to measure temperature. It will 
be remembered that the physicist postu- 
lates a relation between equal incre- 
ments of an A-magnitude and tempera- 
ture. When the physicist has done this 
“the laws relating other physical vari- 
ables with temperature as so defined be- 
come open to empirical investigation” 
(13). 

The psychologist has just as much 
right to postulate a relation between 
difficulty and ¢ units or, for that matter, 
he may postulate a relation between dif- 
, ficulty and percentage passing. There is 
no more reason to postulate the relation 
that equal units of the base line repre- 
sent equal units of difficulty, than there 
is to postulate the relation that equal 
units of percentage represent equal units 
of difficulty. | 

If the psychologist postulated a rela- 
tion between percentages and difficulty, 
there seems to be little doubt that he 
would remember that this was not a 
demonstrated relation but a postulated 
one. Certainly no psychologist would 
contend that it has been demonstrated 
that the difficulty between two test 
items passed respectively by 80 and go 
per cent of the population is the same 
increment of difficulty that exists be- 
tween two items passed by 50 and 60 
per cent of the population. But he may 
certainly postulate this relation. If he 
does, he may then find the laws relating 
other variables to difficulty so defined. 
When the psychologist uses « units, and 
has thereby gone one more step from his 
original data, it seems more difficult for 
him to realize that the relation is still 
a postulated one and not a demonstrated 
one. But, again, by postulating this rela- 
tion he can provide himself with a very 
useful tool. He will not get into diffi- 
culty until, either in his theorizing or his 


> 


practice, he forgets that the relation is 
not demonstrated, but postulated. When 
he adds these ¢ units he is adding equal 
distances along the base line of the nor- 
mal curve and when he subtracts these 
units he is subtracting units of equal dis- 
tance along the base line of the normal 
curve; he is not adding units of difficulty 
nor is he subtracting units of difficulty. 

Scales so constructed may be of im- 
mense value, particularly in “applied” 
work, When they are used as research 
instruments, or when they are used as 
the basis for theorizing, they may well 
lead to faulty conclusions. That is, they 
may do this if the person interpreting the 
results forgets that he is dealing with a 
postulated relation. If difficulty could be 
measured fundamentally, it would be 
possible, even probable, that the relation 
between difficulty, so defined, and other 
variables would be very different from 
the relations between ¢ unit difficulty 
and these same variables. 

A similar situation exists in the field 
of learning. At the present time it is 
considered impossible to measure learn- 
ing directly. It is considered to be a B- 
magnitude, measured in terms of other 
fundamental magnitudes, such as time. 
It is a strong temptation to regard learn- 
ing as something which exists apart from 
the operations used to measure it. If 
the psychologist adopts this attitude, he 
is surprised when he discovers that dif- 
ferent measures of learning do not always 
give the same results. He forgets that the 
operations that he uses create the magni- 
tude, and there is no reason to assume 
in advance that the magnitudes created 
by different operations should be the 
same. 

Thus the “strength” of a reflex may 
be defined in terms of response latency, 
magnitude of response and response rate. 
Then, if these measures do not corre- 
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spond perfectly, the psychologist may be 
puzzled because he believes there ought 
to he perfect correspondence, as all meas- 
ure the same thing, viz. “reflex strength.” 
The fallacy is obvious from the preceding 
discussion, There may be no unitary 
magnitude “reflex strength” defined in 
terms of response latency, magnitude of 
response and response rate but simply 
three independent B-magnitudes, each 
one of which varies in its own character- 
istic fashion when it is the dependent 
variable in a given experiment. 


SECTION F. CRITICISMS OF PSYCHOLOGI- 
CAL MEASUREMENT BY . 
PSYCHOLOGISTS 


The plan adopted for dealing with the 
physicists’ objections will be used in this 
section. First the theoretical position 
taken by some psychologists will be out- 
lined and then some of the specific criti- 
cisms against the experimental methods 
will be discussed. As was obvious in the 
previous section, this division is arbi- 
trary. In reality, theory cannot be di- 
vorced from. practice. 

No attempt will be made to present 
all of the criticisms of all psychologists. 
Obviously this would be an enormous 
and useless task. In fact, only a selected 
group of criticisms will be offered. ‘These 
will be limited to criticisms that are di- 
rectly pertinent to the points raised by 
the logicians. Much of the writing of the 
psychologists has been repetitive, so that 
only representative arguments will be 
brought forward. 

McGregor® (31) im 1935 wrote a 
critical study of the application of the 
criteria for measurement to psychological 
phenomena, but Cohen and Nagel seem 
to have made the first important refer- 
ence to the failure of psychological 
measurement to meet the necessary Ccri- 


* Working with E. G. Boring. 


teria. McGregor, basing his arguments on 
Cohen and Nagel, Bridgman and Camp- 
bell, reviews the criteria already discussed 
and applies them to psychological meas- 
urement. 

His main conclusion is that there is 
no real difference between measurement 
in physics and in psychology. Operation- 
ally they are the same, as both depend 
ultimately upon the discrimination of 
difference. In this respect, he holds that 
from the operational point of view, the 
judgment of equality is “defined nega- 
tively in terms of inability to discrimi- 
nate” (31). This is similar to the view 
expressed previously in this study. It 
was held that the equal judgment is in 
reality a judgment of no difference. Mc- 
Gregor’s definition is, perhaps, a more 
accurate one, as it is true that the judg- 
ment of no difference is based upon the 
inability to discriminate a difference. 
However there is no contradiction be- 
tween the two views. 

McGregor’s argument for the opera- 
tional identity of the two types of meas- 
urement is sound enough if one goes no 
further than the basic operation of a 
“discrimination of difference.’” However, 
as has been seen, there are other opera- 
tions in measurement. One difference 
between the two types was noted in the 
introduction. — 

The second point that McGregor 
makes is that such magnitudes as bril- 
liance, chromatic saturation, loudness, 
weight, pressure, sweetness and pain are 
measurable in the ordinal sense. This, 
of course, the physicists will admit. 

The third point that he makes is that 
equal sense distances are additive. First, 
he contends, it is necessary to obtain a 
series of equal sense distances. Then, 


“Demonstration of the first law?* is easy, 


*Le, A+B> A’ when A=A’ and B>O. 








APPLICATION OF PHYSICAL MEASUREMENT TO PSYCHOLOGICAL MAGNITUDES 


1 but demonstration of the second law?" 
‘ presents methodological difficulties, 
| particularly when the ‘sense distances’ 
; are not coterminous. Nevertheless, 


through the use of a method of substitu- 

, tion similar to that employed by the 
physicist in establishing a standard series 
of weights, we can demonstrate that the 
sum of a series of ‘sense distances’ is 
independent of the order of their addi- 
tion.?® This method of substitution en- 
ables us to define®® the operation of 
addition much as it is defined for length 
in case I’ (31). It is certainly not clear 
from this exactly what the definitipn of 
addition is. 

H. M. Johnson (28) has published a 
criticism of “pseudomathematics” in psy- 
chology. He reviewed the logical require- 
ments for measurement and discussed the 
measurement of brightness,*° hue and 
attitude in the light of these criteria. The 
criticisms leveled against the attempts 
at measurement in psychology are ex- 
tremely pertinent with one very impor- 
tant exception. His discussion of the ad- 
ditivity of brilliance brings up an ex- 
tremely important point. To quote him, 
“Perceptible brightnesses are not iden- 
tical with what the physicist calls the 
brightness or luminosities of surfaces. .. . 
It is imperfectly correlated with percep- 
tible brightness within certain limits, but 
it is not identical with the latter. 

“Consider a surface illuminated by two 
sources S, and S, in succession. When S, 
is used alone, the observer perceives a 
brightness of the surface which we may 
call B,; when S, is used alone, he per- 
ceives a brightness B, on the same sur- 
face. Using the method of flicker pho- 

“Le, IX, X, XI. 

* The sentence “The sum of a series . . . of 
the order of their addition,” is a verbal state- 
ment of the second law (IX, X, XI). 

* Addition in the case of lengths was defined 


as the placing of the systems end to end. 
* T.e., brilliance. 
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tometry, or else the method of direct | 
comparison, let us balance B, against a | | 
comparison-field B’,, and also balance B, | 
against another comparison-field §B’,. se 
Now, expose the surface to both sources | |. ia 

S, and S, at once. If we agree that in | he 
so doing, we add B, and B,, then by the ie 
axiom of equals B, + B, = B’, + B’,, r te 








and B, + B’, = B’, + B.,,. But this is not a 
generally true. Suppose, for example, ta 


that the sources which produced B,.and 
B, respectively emitted only lithium light 
(A = 671), while the sources that pro- ee 
duced B’, and B’, respectively emitted 40 
only thallium light (A = 535). Suppose, 
moreover, that B, = B’, is high, while 
B, = B’, is low. Then B, + B’, is the * 
sum of a bright red and a dim olive Aas 
green, while B’, + B, is the sum of a \%y bb 
bright olive green and a dim red. Al- Ce 
though observation yields the separate »\ ‘ 
equations B, = B’,, B, = B’,, it is very 
likely to yield B, + B, = B’, + B’,. The \ 
operations may not satisfy or even ap- — 
proximate the axiom of equals” (28). 

The first important fact to notice in 
Johnson’s discussion is his clear recogni- 
tion that the magnitude being scaled is 
a subjective magnitude. It is not the 
stimulus correlate, the physical magni- 
tude, which is being measured. 

The next important point to note is 
that which is contained in the sentence, 
“If we agree that in so doing we add B, 
and B,, then by the axiom of equals 
B, + B, = B’, + B’,. .” But why 
should one agree that the proposed com- 
bination of stimuli is an operation for 
the addition of the subjective phenom- 
ena? More generally it might be asked, 
why should one agree that any propose 
method of combination should be ac 
cepted as the operation for the addition 
of any magnitude? As has been seen it 
is impossible to tell before experimenta- 
tion whether any proposed operation 
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will meet the criteria for addition. It 
was also pointed out that this does not 
mean that the experimenter cannot make 
use Of his past experience in choosing 
an operation which he feels has some 
chance of success. This is most probably 
what Johnson has done. On the basis 
of past experience with physical magni- 
tudes it seems reasonable that this opera- 
tion will add the magnitudes. In short, it 
meets the criteria of physical combina- 
tion or physical juxtaposition. But it has 
also been shown that the criterion of 
physical juxtaposition has arisen because 
it is appropriate to physical measure- 
ment. Is there any reason to assume that 
‘the operation for addition of a psycho- 
logical magnitude will be the same as 
that for the addition of a physical mag- 
nitude, especially when, as Johnson says, 
the physical magnitude is imperfectly 
correlated with the psychological? It 
might seem that the very existence of the 
imperfect correlation might lead one to 
expect that the operations would be dif- 
ferent. 

The operation for adding a subjective 
magnitude will be that operation that 
satisfies the necessary criteria for the sub- 
jective magnitude. This does not mean 
that the operations will or cannot be the 
same as those for the physical magnitude. 
It means that they cannot be the same 
when the relation between the physical 
and the subjective magnitudes is not 
linear. 

It might be well to ask here how it can 
be known that the correlation between 
the physical magnitude and the subjec- 
tive magnitude is not linear unless the 
subjective magnitude has first been meas- 
ured. Actually it is not necessary to 
measure a subjective magnitude funda- 
mentally in order to answer this ques- 
tion. All that it is mecessary to do is to 
show that equal increments of physical 


magnitude do not correspond to equal 
sense distances. For example, if tones of 
100 and 200 cycles are presented to an 
observer and he reports that the sense 
distance betwéen o and 100 cycles is not 
equal to the sense distance between 200 
and 400 cycles, the experimenter is justi- 
fied in concluding that the two magni- 
tudes are not linearly related. 

It is necessary to go into this most 
important matter further. To give the 
example presented by Stevens (40): sup- 
pose that the observer was given a tone 
of 40 cycles and asked to find a tone 
equal to this in pitch. It is obvious that 
the observer would select another tone 
of 40 cycles (or very close to it). The 
experimenter now adds these two tones 
and presents a tone of 80 cycles and the 
observer is asked to find a tone equal 
to it in pitch. He will select a tone of 
80 cycles. The experimenter now adds 
these tones and presents a tone of 160 
cycles and asks the observer to find a 
tone equal to this tone in pitch, etc. Has 
the experimenter constructed a scale of 
pitch? It can be argued that it is a scale 
of pitch as the observer did not judge 
the physical correlate, the cycles per sec- 
ond, he judged the discriminable char- 
acteristic, pitch. 

But as Stevens says, “. . . although at 
the outset we could conceivably choose — 
any one of several sets of operations as 
defining the scale, that set will ultimately 
prove to be most satisfactory for a sub- 
jective scale when it leads to scale num- 
bers bearing a reasonable relationship 
to the experience of the observer” (38). 
In other words, in the example used 
above, the numeral 1 might be assigned 
to the pitch of a tone of 40 cycles and 
the numeral 2 to the pitch of the tone 
of 80 cycles. The question is, do these 
numerals represent the subjective magni- 
tude of pitch? Is the pitch of a tone of 
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80 cycles subjectively twice that of the 
pitch of 40 cycles; or, to put it more ac- 
curately in the terms of the operations 
which would be performed, is the pitch 
distance from o to 40 cycles equal to the 
pitch distance from 40 to 80 cycles? If it 
is not, the constructed scale does not cor- 
respond to the subjective magnitude of 
pitch. 

It will be remembered that Campbell 
says that the operation of addition deter- 
mines whether extensive magnitudes are 
the same, It might seem then that the 
operation of addition would determine 
whether the magnitude is subjective or 
physical. But the operation for addition 
will only distinguish between the two 
when the subjective magnitude is not 
linearly related to the physical magni- 
tude. For example, if 
is a line B, and 
is a line B’, so that B = B’, 
and these two are added physically to 
produce another line C 
, is the resulting mag- 
nitude physical or subjective? By the 
criteria of identity of the additive oper- 
ations it is necessary to declare that this 
is a physical magnitude. 











But now suppose that zero physical 
magnitude is given the observer together 
with the physical magnitudes B and C, in 
this fashion: 
A 
O 
B 





C, and the 





observer is asked to judge whether the 
intervals AB = BC. The answer most 
probably will be yes. By this criterion, 
then, the magnitude is not only a physi- 
cal magnitude but also a subjective mag- 
nitude. Furthermore, if the lines were 
presented in this fashion, 


(B) 





(B’) 





and the observer was asked to add them 


subjectively and reproduce that line 
which was the sum of B and B’ it is 
highly probable that he would produce 
the same line that was obtained by the 
method of physical addition. 

In short, the physical and subjective 
methods of addition would probably give 
the same results. 

In other words, it seems that if the 
relation between the physical and sub- 
jective magnitudes is perfectly linear, the 
operation of addition proper for one will 
also be proper for the other. If they are 
not so related, it will be necessary to find 
some other operation for addition in 
order to measure a subjective magnitude 
fundamentally. However it must be 
noted that linearity over a very wide 
range can never be expected. 

It is, of course, impossible to say in 
advance what these psychological opera- 
tions for addition will be. One thing that 
seems certain is that they will not include 
physical juxtaposition except in the case 
where the subjective and physical mag- 
nitudes are linearly related. The experi- 
menter will not add the physical corre- 
lates; the observer will add the two sub- 
jective magnitudes. They will be subjec- 
tive in the sense that the equation of 
equal appearing intervals is subjective. 
The observer may be expected to make a 
judgment similar to that mentioned in 
Section E, 3, of an observer in one of 
the experiments reported below. He had, 
it will be remembered, two criteria for 
a 4 judgment; first, the equation of the 
intervals and, second, the check on this 
judgment by asking himself, “If I added 
B to B’ would they equal C?” 

It is obvious that this sort of judgment 
will be rather difficult for many observ- 
ers. If the experimenter is fortunate 
enough to find that the magnitude with 
which he is dealing is additive, and if 
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he also finds that the results obtained 
with the additive method correlate per- 
fectly with the results obtained by the 
method of equal appearing intervals, he 
will, of course, be at liberty to abandon 
the more difficult method in favor of the 
easier. However it will always be neces- 
sary to show additivity and perfect cor- 
relation between the two methods_ be- 
fore this is done, 

As the method of equal appearing in- 
tervals (including the special case,— 
fractionation) is at the present time the 
only method for obtaining subjectively 
equal units,’it would be well to review 
some of the important criticisms lodged 
against, it. 

For example, Guilford says: “But the 
reader must be reminded again of the 
still doubtful status of equated psycho- 
logical intervals. There is the finding of 
Hevner that intervals among stronger 
stimuli are underestimated as compared 
with those among the weaker stimuli. 
Much earlier, Ament had found that, in 
a bisected interval, the higher of the two 
segments contained fewer j.n.d.’s than 
the lower. Whether the judgment of 
supraliminal differences can ever be 
brought into line with the judgments of 
liminal differences is hard to say” (21). 
Why should the two methods, based on 
different operations, yield the same re- 
sults? Furthermore Guilford is assuming 
that the true scale for measuring a psy- 
chological magnitude is the j.n.d. scale 
so that the scale constructed from the 
method of equal appearing intervals 
must agree with it or be discarded. As we 
have seen, the method of equal appearing 
intervals is designed to give equal units. 
The operations used are those for obtain- 
ing, equality, i.e., the judgment of “dif- 
ference” and “no difference” between 
the intervals concerned. In the method 
of just noticeable differences the equality 


a 


of the j.d.n.’s is an assumption. There is 
no judgmerit of “difference” and “no dif- 
ference” between two different j.n.d.’s. 
There is nothing in the operations for 
obtaining the j.n.d.’s that allows one to 
interpret them as equal. Without such 
an operational basis the statement of 
equality is meaningless. It may be true 
that they are equal, but this must be 
established by comparing the j-.n.d. scale 
to a scale constructed upon a set of 
operations designed to give equal units. 
Hevner (26) has found that the method 
of paired comparisons and the order of 
merit method gave the same results when 
used to measure goodness of handwrit- 
ing. What she calls “the method of equal 
appearing intervals” did not give results 
comparable to the other two methods. 
Should this be surprising? As has been 
seen, there is no reason to be surprised 
if the use of different operations gives 
different results. In fact it is surprising 
if they give the same results. 
Furthermore, Hevner did not actually 
use the method of equal appearing in- 
tervals. After all, the very essence of the 
method consists in making the intervals 
equal and Hevner did not instruct her 
subjects to make the intervals equal. Her 
instructions seem to have been copied 
from Thurstone and Chave (48), who 
also seem to have omitted the instruction 
to make the intervals equal. These ex- 
perimenters, using what they call the 
method of equal appearing intervals, 
instructed their observers to arrange a 
group of statements, having to do with 
appreciation of the church, in the fol- 
lowing manner: “You are given eleven 
slips with letters on them, A, B, C, D, 
E, F, G, H, I, J, K. Please arrange these 
before you in regular order. On slip A 
put those statements which you believe 
express the highest appreciation of the 
value of the church. On slip F put those 
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expressing a neutral position. On slip K 
put those which express the strongest 
depreciation of the church. On the rest 
of the slips arrange statements in accord- 
ance with the degree of appreciation or 
depreciation expressed in them. This 
means that when you are through sorting 
you will have eleven piles arranged in 
order of value-estimate from A, the high- 
est, to K, the lowest” (48). There is no 
intimation given the observers that they 
are to make the interval between the 
statements placed in A and B equal to 
the interval between B and C, etc. All 
the instructions tell the subject to do 
is to arrange the samples in a given 
order, starting with A, the highest, and 
ending at K, the lowest, using F as the 
neutral or mid-point. 

The observers may equate the inter- 
vals without being instructed to do so, 
but without specific instructions to the 
contrary there is no guarantee that they 
did not simply arrange the statements in 
a rank order of eleven steps. In fact, since 
that is all they were instructed to do, it 
is even probable that that is all they did 
do. 

It is unfortunate that one must offer 
this criticism of this first important at- 
tempt to measure a discriminable charac- 
teristic which has no known stimulus 
correlate. 

However Thurstone has used what he 
calls the method of equal appearing in- 
tervals under protest. He claims that “the 
ideal unit of measurement for the scale 
of attitudes is the standard deviation of 


the dispersion projected on the psycho-. 


physical scale of attitudes by a statement 
of opinion, chosen as a standard” (48). 
The logical fallacy of statistical scales 
has already been discussed. 

Hevner takes a position somewhat 
similar to that of Thurstone. She says 
that there are “several facts that point 





’ for measurement. 


to the superiority of the method of 
paired comparisons and the order of 
merit method over the method of equal 
appearing intervals” (26). To take these 
up in order: 

1) The order of merit method and the 
method of paired comparisons give the 
same scale. The method of equal appear- 
ing intervals does not check with the 
other two. 

2) There is no check on internal con- 
sistency in the method of equal appear- 
ing intervals, This, according to Hevner, 
is the most important criticism of the 
method. 

3) In the method of equal appearing 
intervals the frequency distributions are 
skewed at the end of the scales. The 
medians for the ends are not as represen- 
tative as the medians at the middle of 
the scale. 

4) The j.n.d. or discriminal error, 
which is the fundamental psychophysical 
unit of measurement, is incorporated in 
both the order of merit method and the 
method of paired comparisons but is not 
involved in the method of equal appear- 
ing intervals. 

To answer these objections in order: 

1) The fact that two methods check 
one another does not mean that either 
one meets the necessary logical criteria 
The fact that the 
method of equal appearing intervals does 
not check with the method of paired 
comparisons and the method of rank 
order does not mean that the method 
of equal appearing intervals is not ac- 
ceptable; in fact from what we have 
seen it may, and probably does, mean 
that the other two are unacceptable as 
measures of subjective magnitude. The 
so-called “equality” obtained by the 
other methods results from statistical 
manipulations. The appropriateness of 
a scale must depend on the operations 
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used in constructing it, not on such 
mathematical manipulations. 

2) Hevner has said that the method 
of equal appearing iritervals lacks a check 
of internal consistency. But all of the 
logical criteria for measurement are 
checks of internal consistency. There is 
another check which has been applied 
by Gage to the consistency of loudness 
judgments. The method runs as follows: 

Given the stimulus distance AE: 


A B C D E 


the observer is first instructed to equate 
the distance AC, CE. The observer is 
then presented with the stimulus dis- 
tance AC and equates the distance AB, 
BC; and, likewise, he is given the stimu- 
lus distance CE and he equates CD, DE. 
The observer is then given the distance 
BD and he equates BC, CD. If there are 
no errors operating in the procedure the 
observer should “bisect” the distance BD 
at the original point for the “bisection” 
of AE, ie., C. Incidentally Gage found 
that his final “bisection” was consider- 
ably higher than the initial “bisection.” 
Newman, Stevens and Volkmann (32) 
used the same check on the judgment of 
loudness and, after making improve- 





ments in Gage’s procedure, came to the 


conclusion that Gage’s results were due 
to constant errors, all of which could be 
creatly minimized. 

3) Hevner’s third criticism is obviously 
not a criticism of the method of equal 
appearing intervals as such, but simply of 
her particular application of it. 

4) The last criticism has already been 
answered in the discussion of the j-.n.d. 
as a measuring device. 

Several types of defense have been of- 
fered in support of the psychologists’ po- 
sition. One of the commonest may ‘be 
summed up by quoting Bartlett and 
Craik, who-were members of the above 


) 


| 
| 


mentioned Committee of the British As- 
sociation for the Advancement of Science. 

“If all measurement must conform to 
the Laws of Measurement enunciated by 
Dr. Campbell, and, in particular, if the 
second law can only be satisfied by the 
physical juxtaposition of equal entities, 
then sensation-intensity cannot be meas- 
ured. Yet this standard would have im- 
posed a severe handicap in the early days 
of natural philosophy, and, maybe some 
sciences must still be allowed a greater 
latitude. The alternative would seem 
to be the coining of a new title for much 
that is called measurement. 

“A scale built up by the addition of 
equal standard units is the ideal, but to 
say that measurement is possible only by 
such scales, would seem to be an unhelp- 
ful limitation of the meaning of the 
word” (Bartlett, R. J., 14). 

“The Committee seems to me to have 
been facing two main points: (a) is sen- 
sation intensity measurable? (b) why 
should anyone want to make out that 
sensation intensity is measurable, and 
why should anyone want to measure it? 

“The answer to the first of these ques- 
tions must be sought by finding a defini- 
tion of measurement which fits its use 
in other sciences, and then asking 
whether the facts obtained by psycho- 
logical experiments enable the estima- 
tion of sensation magnitudes to be sub- 
sumed under this definition. It is im- 
portant not to base the definition of 
measurement only on the most stringent 
instance, such as length; for ‘measure- 
ment’ is applied also to scales of temper- 
ature, density, time, etc. which fail to 
fulfill one or other of the conditions 
which are fulfilled by length. Thus, to 
insist that a quantity is measurable only 
if the operation of adding together two 
numerical quantities predicts the result 
of combining such quantities of the given 





_ 
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physical magnitude, would rule out the 
temperature scale quite as much as a 
sensation scale” (Craik, 14). 

These ‘statements give the impression 
that the whole point of the physicists’ 
objections has been misunderstood. If 
one examines the statement of Guild’s 
quoted on p. 6 it is true that he might 
seem to be forbidding the use of the 
term “measurement” to psychologists. 
But it seems to the author that the argu- 
ment is not really over the use of the 
term, rather it is over the use of the 
meaning of the term as defined by the 
physicists. The logicians and physicists 
have appropriated the word ‘“measure- 
ment” and have defined it in their own 
way. The definition is rigid and exact. 
The physicists say that psychologists may 
not use the term with this meaning be- 
cause they have not demonstrated the 
relations necessary to give the term this 
rigid and exact meaning. Suppose the 
physicists and psychologists redefined 
the term “measurement” along the lines 
suggested by Craik above. They might 
arrive at some such definition as Scates 
(345 35) Seems to imply, namely, that 
measurement is anything that any intelli- 
gent scientist has ever called measure- 
ment. Then it would be necessary for the 
psychologists to use a new term to indi- 
cate what the physicists now call “meas- 
urement” or “fundamental measure- 
ment.” Suppose they chose the word 
“scale.” It is certain that the physicists 
would not now object to the use of the 
word “measurement” by the psycholo- 
gists but would object strongly if they 
used the word “scale.” 

The author of this study suggests that 
Campbell's definition of measurement be 
adopted: the assignment of numerals to 
’ systems according to scientific laws. This 
definition would include all of the kinds 
of measurement discussed in this paper; 


the ordinal scale, Stevens’ “intensive 
scale,” the equal unit scale and both the 
A- and B-magnitudes. In order that this 
use of the term will not lead to confu- 
sion it is only necessary to prefix the 
correct adjective when one wants to 
speak of one of the specific kinds of meas. 
urement contained in the definition, such 
as “ordinal measurement,” “equal unit 
measuremen:,”’ etc. 

When Bartlett says that the rigid 
standard imposed by the physicists would 
have been a severe handicap in the early 
days of physical science, it is difficult to 
see what he. means. Certainly it is not 
a handicap to be unable to use the term 
measurement. The only handicap that 
the physicists could have suffered would 
have arisen from the inability to perform 
the necessary operations to demonstrate 
the required relations. But that does not 
mean that the early physicists did not 
continue to use what tools were at their 
disposal until better ones could be de- 
vised. 

Likewise the physicists do not demand 
that psychologists immediately cease 
demonstrating those relations which they 
are able to demonstrate; but they do 
point out that the psychologists should 
not interpret their data on the basis of 
undemonstrated relations. 


SECTION G. ZERO SUBJECTIVE 
MAGNITUDES 

As was seen in the last part of Section 
B, a zero magnitude is associated with 
the proposition, if 
A4+B=A’ when A= A’, 

B has the magnitude of zero. 

Let A stand for the subjective weight 
correlated with a physical weight of 100 
gr. and B stand for the subjective weight 
correlated with a stimulus increment of 


3 gF. 
If the two physical magnitudes are 
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added, the resulting psychological mag- 
nitude A+ B > A’ when A = A’. 

But if A is associated with a physical 
magnitude of 50 gr. and B still with one 
of 3 gr., the addition of the physical 
magnitudes will yield a psychological 
magnitude A + B that is greater than A’. 
Thus B is O in one case and in the 
other case it is not zero. 

Is this situation different from the situ- 
ation existing in the realm of physical 
measurement? If the physicist, in weigh- 
ing 5,000 lbs., adds 1 gr., then 
5,000 lbs. + I gr. > 5,000 Ibs., 


but if he is weighing 2 gr. on a sensitive 
scale and adds 1 gr., then 
2 gr.+ 1 gr. > 2 gr. 

In other words, zero is dependent upon 
the operations used for measurement. In 
this case the operations involve the use 
of scales of different sensitivity. If this is 
true, what then is the absolute zero? It 
would seem to be associated with a 
different system if one uses scales of dif- 
ferent sensitivity. It seems logical that 
the absolute zero will be the magnitude 
of that system that fulfills 


A+ B> A’ (when A= A’) 


and the rest of the criteria for equality, 
on the most sensitive measuring device 
that has been devised. 

In psychology the absolute threshold 
will be the magnitude that will fulfill 
the above conditions for the whole range 
of any given subjective magnitude; in 
other words, the absolute threshold 
would seem to be the logical zero magni- 
tude for all discriminable characteristics. 


H. MEASUREMENT WITHOUT 
PHYSICAL CORRELATES 


SECTION 


When the physicist measures a funda- 
mental magnitude the operations he, per- 
forms do not depend in any way upon 
any other measurable magnitude. If they 


] 


did he would not be in possession of an 
A-magnitude but of a _ B-magnitude. 
However it is necessary for him to be 
able to identify or reproduce the systems 
that he is measuring in respect of this 
given magnitude. If he is measuring 
weight, he must be able to tell one sys- 
tem from another. This could be done 
by marking the systems, say with letters 
of the alphabet, or by painting each one 
of them with a different color, red, green, 
purple, etc. Also, if he had previously 
measured their volume, he could iden- 
tify them by their volume. But it is im- 
portant to note that the operations he 
performs on the systems do not depend 
upon volume as a measurable magnitude. 
The volume merely serves as a means of 
identification and he could have used 
any other method of identification that 
was convenient. 

Suppose that after he has measured the 
systems in respect of weight he plots 
weight against the identifying marks. 
That is, he plots weight against A, B, C, 
D, E, F, G, H, I, J, K. The plot might 
be something like that in Figure 3. But 
the question must be asked, why should 
the identifying marks on the abscissa be 
ordered in the way they are? Is it not 
equally valid to plot weight against the 
alphabet as arranged along the abscissa in 
Figure 4? The further question might 
be asked, why are the identifying marks 
equally spaced along the abscissa? Could 
not the plot be made as in Figure 5, with 
the alphabet scattered randomly along 
the abscissa? The answer is that both of 
these possibilities are equally valid be- 
cause A, B, etc., have no meaning outside 
their use as identification marks. As iden- 
tification marks they have no order and 
they are not necessarily equally distant 
from each other. The only reason for 
ordering and spacing A, B, etc., along the 
abscissa as they are in Figure 3 arises 
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from the fact that it has been found 
that the letters belong to systems that are 
ordered and equally spaced in respect 
of weight. In other words, the physicist 
could really be plotting weight against 
weight. Even though the units along the 
base line are letters of the alphabet, they 
obtain their relations to each other from 














at equal distances along the abscissa any 
more than there was reason for placing 
the letters of the alphabet equally distant 
along the abscissa, But the plot is very 
meaningful if weight is plotted against 
volume qua volume. In this case weight, 
as the dependent variable, is plotted 
against another magnitude that is sus- 
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the fact that they now represent systems 
that have been scaled with respect to 
weight. The plot of weight against iden- 
tifying marks is meaningless. 

Suppose, however, the physicist had 
used volume as his means of identifica- 
tion. Is the plot of weight against volume 
meaningless? There are two answers to 
this question. Fhe plot is meaningless if 
volume is simply an identification mark, 
because if that is all it is there would 
be. no reason for placing units of volume 


ceptible to fundamental measurement. 
The order and the equality of the units 
of. volume in this case arise from the fact 
that volume has previously been meas- 
ured fundamentally. In short, the physi- 
cist has plotted two fundamental magni- 
tudes against each other and is in pos- 
session of a functional relation. 
Turning to psychological magnitudes, 
consider the pitch function of Stevens 
and Volkmann, which was constructed 
after the fashion outlined in Section D. 
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First it is necessary to assume, solely for 
the purpose of this discussion, that the 
operations used by Stevens and Volk- 
mann (42) have met all the criteria for 
measurement. ‘Then the question may be 
asked, does the. measurement of this 
psychological magnitude depend on the 
prior measurement of frequency. If it 
does, then pitch is not a fundamental 
magnitude. 

It should be obvious that the case of 
pitch and frequency is analogous to the 
case of weight and volume discussed 
above. In short, Stevens and Volkmann 
actually used frequency as a means of 
identifying or reproducing any given 
system with which a certain pitch is as- 
sociated, just as the physicist might have 
used volume as a means of identifying 
any given system with which a certain 
weight is associated. Stevens and Volk- 
mann could have used any other con- 
venient method of identifying pitch. For 
example, the identification might have 
been made by some arbitrary marks on 
the dial of the reproducing instrument. 
In short, pitch is measured indepen- 
dently of any other magnitude. 

The pitch function happens to be a 
convenient graphic method for the as- 
signment of numerals to different pitch 
magnitudes. The fact that it also shows 
the relation of pitch to frequency ts inct- 
dental so far as measurement is concern- 
ed. When the magnitude of a number 
of identifiable pitches has been deter- 
mined and the pitch function is con- 
structed, the magnitude of intermediate 
pitches may be estimated by interpola- 
tion. However this must not be confused 
with measurement. The fact that the 
pitch magnitude associated with any fre- 
quency may be read from the pitch func- 
tion does not mean that the pitch func- 
tion is necessary for measurement, It 
simply means that, having measured a 


mn 
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certain number of pitches fundamen- 
tally, the function can be used to give 
an estimate, albeit a reliable estimate, 
of the magnitude of other pitches. In the 
same way the weight-volume function 
might be used to estimate the weight 
associated with any given volume. 

The pitch function may then be said 
to serve a double purpose. In the first 
place, it shows the relation between the 
subjective magnitude pitch and the phy- 
sical magnitude frequency and, in the 
second place, it allows an estimate of the 
magnitude of intermediate pitches. 

It can be stated then that the opera- 
tions for measurement in psychology do 
not necessarily depend upon the prior 
measurement of any other magnitude. 


SECTION J. SUMMARY AND CONCLUSIONS 
FOR PART I 

The author’s conclusions up to this 
point may be summed up as follows: 

1) In order to establish an ordinal 
scale it is necessary a) to define >, < 
and = by means of a set of operations 
used in establishing these relations, and 
b) it is necessary to demonstrate experi- 
mentally that the relation > and < de- 
fined by this set of operations is asym- 
metrical and transitive; and that the 
relation expressed by the symbol =, de- 
fined by this same set of operations, is 
symmetrical and transitive. — 

In other words, with respect to > and 
< the following criteria must be met: if 
A>B, then BHA I (11) 
AS Band B>C, then A>C, II (11) 


and with respect to =, the following 


criteria must be met: 


A} BandA ¢{ B Ill 
if 

A>CthennB>C IV 
A < Cthen B < C. Vv 


2) When the criteria above have been 
met it is then possible to assign numerals 
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which may represent the relations dem- 
onstrated. Two types of numerals may 
be assigned, the conventional ordered 
numeral series, 1, 2, 3, 4, etc., or a series 
without a conventional order. 

If the numerals with the conventional 
order are assigned to represent the sys- 
tems in respect of the ordered magnitude, 
it will be convenient that they be as- 
signed so that they may be interpreted 
conventionally, i.e., ‘so that the conven- 
tional order of the numerals agrees with 
the demonstrated order between the sys- 
tems. Since this is so, Campbell’s rule for 
the assignment of numerals to an ordered 
magnitude must be followed. It must be 
borne in mind that the conventional 
order of the numerals-adds nothing to 
the significance of the demonstrated rela- 
tions. 

If non-conventional numerals are as- 
signed, the numerals will represent the 
order of the systems as in the case above. 
The chief difference is that when non- 
conventional numerals are assigned the 
person who uses the numerals must learn 
a new ordered numeral system, the order 
of which is determined by the order of 
the systems to which the numerals are 
assigned. 

The assignment of non-conventional 
numerals emphasizes the fact that the 
numerals assigned to a group of systems 
do not create the relations between these 
systems. Experimentation demonstrates 
the relations, while numerals are merely 
used to represent previously demonstrat- 
ed relations. 

3) In order to construct an extensive 
scale it is necessary to define ( ) and + 
by a set of operations. 

When this has been done, it is neces- 
sary to meet the following criteria: if 


A = A’, and B > o, then A + B > A’ Vill 
A+B=—X,thnB+A=>X IX 
A= A’ and B= B’, then A+ B=A’'+B’ X 


and 
(A+ B)4+ C= A’ + (B’+C’),. xI 

4) The above criteria are the only 
ones that it is necessary to meet in order 
to demonstrate additivity. It is not neces- 
sary to meet the criterion of physical 
juxtaposition, though physical juxta- 
position may be necessary in order to 
meet VIII, IX, X, XI. 

5) When the above criteria have been 
met numerals may be assigned to repre- 
sent the systems. 

As in the case of order, either a con- 
ventional or a non-conventional numeral 
series may be used. Whether the conven- 
tional or the non-conventional series is 
used, the assigned numerals obtain their 
meaning from the fact that certain rela- 
tions have been demonstrated to exist 
between the systems to which they are 
assigned. 

If non-conventional numerals are used 
and it is necessary to manipulate these 
numerals in order to predict the result 
of the actual manipulation of the sys- 
tems, it would be necessary to invent a 
new arithmetic. 

If, on the other hand, the conventional 
numeral series is used, it may be inter- 
preted in the conventional manner, i.e., 
the numerals may be treated as if they 
represented objective numbers because 
the same relations have been shown to 
hold between the systems that hold be- 
tween objective numbers. In short the 
ordinary, conventional arithmetic may 
be used to predict the results of actual 
manipulation of the systems. If the con- 
ventional numeral series is used and it is 
wished to interpret them conventionally, 
Campbell’s rules for the assignment of 
numerals must be followed. If a non- 





conventional numeral series is used 


Campbell’s rules for the assignment of 


non-conventional numerals may be fol- 


lowed. 
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6) The scaling methods based on dif- 
ferential sensitivity techniques do not 
meet the logical requirements for A- 
magnitudes nor the logical requirements 
for equal units. They do yield an ordinal 
scale. 

7) The method of equal appearing 
intervals meets the requirements for an 
ordinal scale. This method also provides 
adequate operations for equating sense 
distances. The demonstration of equality 
of units does not demonstrate additivity. 
For this reason the numerals assigned to 
ihe magnitudes may not be interpreted 
as if they represented objective numbers. 

8) The method of fractionation is a 
special case of the method of equal ap- 
pearing intervals, except in so far as 
the observer gives himself the “additive 
instruction.” 

The half judgment or the bisection 
judgment cannot be operationally defin- 
ed apart from 1) the equation of sense 
distances or 2) the addition of equal 
absolute stimulus intensities. 

If the additive operation is not used, 
the same objections that are raised 
against the method of equal appearing 
intervals may be raised against the meth- 
od of fractionation. 


g) The statistical scaling methods pre- 


seit adequate operations for obtaining 
ordinal scales but not for extensive scales. 

B-magnitudes may be constructed by 
the use of these methods. There are obvi- 
ous precautions that must be observed in 
the interpretation of results obtained 
from the use of these methods. 

10) The difference between a physical 
and a psychological magnitude will be 
determined by the operations for addi- 
tion, except in the perhaps non-existent 
case where there is a linear relation be- 
tween the two magnitudes. In those cases 
where the relation between the physical 
and subjective magnitudes is not linear, 


) 


it is still possible that an adequate oper- 
ation for addition of the psychological 
magnitudes may be found. 

11) The operation for addition of the 
physical magnitude would often seem to 
involve physical juxtaposition, whereas 
the operations for the addition of the psy- 
chological magnitude would seem to in- 
volve subjective addition, i.e., a judg- 
ment of + without physical juxtaposi- 
tion. 

12) Psychologists may construct four 
types of scales: 

1) Ordinal scales, i.e., those in which 
the relations of order have been demon- 
strated. 

2) Intensive scales (as defined by 
Stevens), i.e., scales in which adjacent 
numerals are assigned by the adoption 
of some rule. ' 

3) B-magnitudes, i.e., magnitudes that 
are scaled indirectly by some fundamen- 
tal magnitude. There may be a postu- 
lated relation between the increments of 
the magnitude being scaled and the in- 
crements of the A-magnitude by which 
it is scaled. The B-magnitudes and the 
intensive magnitudes are: very similar. 

4) The scales constructed by the 
method of equal appearing intervals (or 
fractionation). These scales have both a 
demonstrated order and equal units, Ad- 
ditivity has not been demonstrated. The 
author disagrees with Stevens’ contention 
that these magnitudes are examples of 
fundamental measurement. Hereafter in 
this study they will be called equal unit 
scales. 

It is the author’s bias that many psy- 
chological magnitudes will yield to the 
operations of fundamental measurement. 
To support this bias he can cite the fol- 
lowing: 

1) Introspectively the additive judg- 
ment seems possible. 

2) Physical juxtaposition is not one of 
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the criteria for fundamental measure- 
ment. 

3) The operations for constructing 
psychological magnitudes are indepen- 
dent of any other measurable magnitude. 

4) It seems probable that perceived 
length (the discriminable characteristic 
that is correlated with the physical mag- 
nitude of length) will prove to be meas- 
urable by both physical and subjective 
operations for addition. If perceived 
length is measurable by a_ subjective 
operation for addition, the author sees 
no reason why other subjective magni- 
tudes may not be measurable. 

5) Campbell has expressed the opinion 
that the physical magnitudes that meet 
VIII (A+ B> A’) will prove to be 
measurable fundamentally. He bases his 
conclusion on his experience with phy- 
sical magnitudes and claims that he 
knows of only a few exceptions to this 
statement, e.g., intensity of x-rays. 

If VIII is fulfilled, Campbell thinks 
that the fulfillment of the second law 
(i.e, IX, X, XI) will depend upon the 
perfection of apparatus and technique. 
In otherwords the essential criterion for 
additivity is A + B > A’, and the rest 
of the criteria will be fulfilled when the 
procedure is so perfected that constant 
errors introduced by apparatus, etc., are 
eliminated. 

Certainly in psychology it is difficult 
to think of a case where the operation of 
subjective addition will not satisfy VIII. 
If an observer is presented with two lines, 


A 





B 





and asked to reproduce a third that is 
equal to the sum of A and B, it is certain 
that the result will fulfill the criterion 
A+ B> A’, Likewise if the observer is 
presented with two lights and asked to 


choose a third light whose brilliance is 
equal to the sum of the brilliance of the 
two, it is certain that the brilliance of the 
third light will fulfill VIII. 

When these five statements are con- 
sidered, fundamental measurement does 
not seem too far away. 

It is true that no subjective magnitude 
has been measured fundamentally. The 
belief of the author that they may be so 
measured is an hypothesis. Only experi- 
mentation can give the answer. But it 
seems that the major objections of the 
physicists have been answered. There are 
no a priori reasons why psychological 
magnitudes may not be measured funda- 
mentally, Measurement in psychology 
and physics are in no sense different. 
Physicists can measure when they can 
find the operations by which they may 
meet the necessary criteria; psychologists 
have but to do the same. They need not 
worry about mysterious differences be- 
tween the meaning of measurement in 
the two sciences. 

The author at one time believed that 
the equal appearing interval experiment 
led to fundamental measurement. He 
had not realized that the demonstration 
of equal units does not constitute a 
demonstration of additivity. The three 
experiments reported in this study use 
the method of fractionation. The results 
show that the three types of magnitudes 
here scaled are amenable to the method. 
They give further evidence that a physi- 
cal correlate is not necessary for the 
measurement of psychological magni- 
tudes. Unfortunately additivity has not 
been demonstrated, but equal unit scales 
have been constructed, and the author 
feels that the demonstration of addivity 
need be no longer delayed because of 
purely theoretical considerations. 
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Part III 


EXPERIMENTAL: THE SCALING OF VISUAL RATE 


SECTION A. THE PROBLEM 


HE PROBLEM in this experiment was 
ts determine whether the perceived 
rate of the flash of a lamp is fundamen- 
tally measurable. It will be remembered 
that the author believed, during the 
course of these experiments, that both 


90 v.DC 
litt 


C 


by bees 


mise 








‘t 














HWov. AC 


Fic. 6. Wiring diagram of the timer. C, the 
variable condenser; NL, neon lamp; SR, sensi- 
tive relay; V, the outlet to the lamp which 
served as the variable. 


the method of fractionation and the 
method of equal appearing intervals were 
valid operations for the fundamental 
measurement of subjective magnitudes. 
\s he has since changed his opinion, it 
might be better to rephrase the ga 
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lem: The problem in this experiment is 
to determine whether an equal unit scale 
can be constructed for visual rate, Fur- 
thermore, if it is possible to construct 
an equal unit scale for visual rate, it is 
intended to find the relation between 
visual rate so scaled and its physical cor- 
relate. 


SECTION B. PROCEDURE 
1) Apparatus 


The current from two B-batteries is 
led through a variable condenser (C, Fig. 
6).5* When the condenser is fully charged 
it discharges through the neon lamp (NL, 
Fig. 6) which is connected in parallel. 
The amount of the condenser’s capaci- 
tance will determine the length of time 
that it takes to charge—the higher the 
capacitance the longer the charging time. 
The rate at which the neon light will 
flash will be controlled, then, by the 
amount of capacitance in the variable 
condenser. A simplified diagram of the 
variable condenser system is given in Fig- 
ure 7. The capacitance in this system 
ranged from 12 mfd to 0.10 mfd. This 
range of capacitance was capable of pro- 
ducing rates from 0.27 flashes per second 
to 12.3 flashes per second. The capaci- 
tance in this system could not be varied 
continuously. The change from 12 mfd to 
0.10 mfd took place in 80 discrete steps. 
The average change in rate for.each step 
was 0.15, though it was sometimes greater 
than this and sometimes less. The change 
in rate from one step to the next was very 


** My thanks are due Mr. W. Rahm for this 
circuit. 
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frequently though not always below a 
noticeable difference. 

The pulse set up by the condenser and 
neon lamp circuit is amplified by the 
vacuum tube circuit and used to drive 
a sensitive relay (SR, Fig. 6). The sensi- 
tive relay will make and break once for 
every flash of the neon light. 

The current from a dry cell battery is 
run through the contact points of the 
relay to the primary of an inductorium. 
The current from the secondary of the 


inductorium is used to operate the vari- . 


able stimulus (V, Fig. 6). Since a current 
will be produced in the secondary of the 
inductorium on both the make and break 
of the sensitive relay, a condenser is 
placed in parallel with the secondary of 
the inductorium. This condenser will 
absorb the “make current” but allow the 
“break current” to pass. In other words 
the variable stimulus will flash only on 
the break of the sensitive relay. The vari- 
able stimulus was produced by a 3 watt 
neon lamp, of the type in which the 
positive and negative poles are opposed 
to each other so that the observer could 
only see the positive pole through the 
hole in the shield. The lamp was 
mounted behind a shield in which there 
was a hole about 34 of an inch in diame- 
ter. The positive pole was toward the 
opening in the shield. 

The observer on looking at the stimu- 
lus could see the brown shield with the 
grey of the positive pole of the neon 
lamp showing through the hole in the 
shield until the lamp flashed. When the 
lamp flashed the hole was filled with the 
orange-red glow of the lamp. 

The purpose of the inductorium in the 
circuit was to keep the duration of the 
flash of the lamp constant as the rate of 
the flashing was varied by means of the 
condenser system. 


The standard stimulus is controlled by 
a system that is almost identical. The 
chief difference between the circuits is 
that there are only 10 standard rates, so 
that the condenser system is much sim- 
pler than that for the variable stimulus. 
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Fic, 7: Simplified wiring diagram of the variable 
condenser circuit. 


It was possible to use the same ampli- 
fying system for the standard that was 
used for the variable, as the standard 
and variable stimuli were not presented 
simultaneously. The fact that they were 
not presented simultaneously necessitat- 
ed a switching system so that the stand- 
ard and variable stimuli could be 
presented alternately for fixed time in- 
tervals. The duration of the standard 
and variable was either 12 seconds each 
or 8 seconds each. The alternation of 
the two stimuli and the duration of their 
presentation was controlled by a Volk- 
mann Timer (50). The circuit for con- 
trolling the standard stimulus and the 
Volkmann Timer circuit for controlling 
the alternation and the duration of the 
two stimuli are not shown in the diagram 


(Fig. 6). 
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‘The instrument was calibrated by con- 
necting a signal marker in series with the 
primary of the inductorium and record- 
ing the pulses on a fast moving smoked 
drum. A time line was recorded on the 
drum by a high speed signal marker 
driven by a transformer which was con- 
nected to the commercial AC line. The 
instrument was calibrated frequently 
and the changes in rate were found to be 
very small. 


2) Experimental Procedure 


The observer’s task was to set the rate 
of the variable stimulus so that it ap- 
peared to be flashing at one half the rate 
of the standard. ‘ 

It is obvious that many extraneous 
cues would influence the observer if he 
manipulated the dials of the variable 
condenser. To obviate the necessity for 
the observer’s performing this operation, 
the experimenter manipulated the dials 
at the demand of the observer. ‘Thus, the 
observer would say “faster,” if he thought 
the variable was flashing at a rate less 
than one half the rate of the standard; 
and “slower,” if he thought the rate of 
the variable was more than one-half that 
of the standard. Since there was a fairly 
loud “click” each time one of the dials 
was moved one step, the observer soon 
learned to tell the experimenter by how 
many “clicks” the variable should be in- 
creased or decreased. In fact all observ- 
ers, after the first, were informed of the 
possibility of using the clicks, and all of 
them readily adopted this method as it 
was definite, straightforward and easy. 

In practice the observers would watch 
the standard, then the variable, and 
while the variable was still flashing, 
would say “Down five” or “Up three.” 
Sometimes they made two adjustments of 
ihe variable during one presentation but 
they would usually wait until the stand- 
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ard and variable had been presented 
again. On the second and subsequent 
presentations of the variable they would 
continue instructing the experimenter as 
before, until they had reached a satis- 
factory 4 adjustment. 

The standards were presented in ran- 
dom order. The original rate at which 
the variable was presented with any 
standard was changed for every presenta-. 
tion of that standard. Sometimes the 
variable was started at a rate that was 
very much faster or slower than one-half 
the standard, at other times it was started 
at a rate that was fairly close to the usual 
one-half judgment of the observer, and 
at still other times it was started at inter- 
mediate positions. 

It was possible for the experimenter to 
produce the clicks without changing the 
adjustment or to produce a certain num- 
ber of clicks that did not correspond to 
the number of steps that the dials had 
been moved. Both of these ruses were oc- 
casionally tried on all the observers but 
neither of them ever seemed to upset or 
affect the final judgment of the observer. 

The observers were not given a set of 
formal instructions. The principle adopt- 
ed was that all of the observers should 
fully understand the task and the opera- 
tions they were to perform in accom- 
plishing it. However the following points 
were always stressed for each observer. 


1) The nature of the task. 

2) The fact that the variable was always on 
the right. 

3) The fact that they must instruct the 
experimenter to adjust the variable. 
4) They were instructed to look at the 
center of the openings in which the 

lights flashed. 

5) They were instructed not to count the 
flashes of either the standard or the 
variable. 

6) They were instructed not to adopt any 
kind of rhythmical movements, such as 
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tapping with the hands, or feet, nod- 
ding the head, etc. 

7) They were told that they might rest 
whenever they were tired. 


The observers were seated at a distance 
of about two feet from the stimuli, which 
were set at an angle so that the shield 
formed an approximate right angle with 
the observer’s line of regard. A daylight 
lamp (60 watts) shone on the shield and 
the two stimuli. The daylight lamp was 
protected so that it did not shine in the 
observer’s face. The purpose of this lamp 
was to reduce the after-images of the 
stimuli which were distinctly noticeable 
in a completely darkened room. The 
lamp was effective in reducing the after- 
images to the extent that no observers 
seemed to notice them unless their atten- 
tion was directed to them. Even then the 
observers reported that the images were 
very slight. 

A trial series was given in which the 
observers were urged to ask questions 
concerning any portion of the task that 
might be disturbing them. 

Each experimental period lasted ap- 
proximately an hour, which usually in- 
cluded one rest period. Several times the 
sessions were somewhat longer. In this 
event another rest period was given. 

There were five observers, who may be 
designated as Du., Co., Lu., Re. and Gr. 


SECTION C. RESULTS 


1) General Results 


It was noted above that the duration 
of both the standard and variable was 
either 12 seconds each or 8 seconds each. 
If the standard flashed for 12 seconds, 
the variable would flash for 12 seconds; 
if the standard flashed for 8 seconds, the 
variable would flash for 8 seconds. The 
experimental sessions were begun with 
the expectation that the 12 second dura- 


tion would be used throughout the ex- 
periments. 

It was found early in the first experi- 
mental session that the 12 second dura- 
tion was too long for the faster rates. 
The observers became impatient because 
the lamps were flashing fast enough for 
the observers to obtain a good idea of 
the rate in a few seconds’ time. However 
it was necessary to use the 12 second pres- 
entation time for the slower rates, be- 
cause if this were not done, the lamps 
would not flash a sufficient number of 
times for the observers to obtain a good 
idea of the rate. 

The question then arose: would the 4 
second difference in the time of presenta- 
tion make a difference in the observers’ 
judgments? If the time did make a dif- 
ference it would be necessary to keep the 
12 second duration throughout the ex- 


. periment. If it did not make any differ- 


ence it would be possible to use the 
shorter duration of presentation for the 
faster rates and the longer duration of 


. presentation for the slower rates. This 


would not only make the task of the 
observer simpler but would shorten the 
total time of experimentation. This was 
rather important as the observers could 
only make about 10 complete judgments 
an hour. 

An attempt to answer the question of 
the influence of the duration of presen- 
tation on judgment, was made in the 
following way: the observer Gr. made 
five judgments of 4 for each of 10 
standard stimuli presented under the 12 
second presentation and he also made 
five judgments of 14 for the same 10 
standard stimuli under the 8 second 
presentation time. The mean 14 judg- 
ments were then compared. The differ- 
ences between the means were tested by 
the ¢ test and none of the ratios reached 
the 5 per cent level of significance. In 
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fact only one, that for the standard with 
a rate of 10.82 per second even reached 
the 10 per cent level of significance. 

A summary of these results will be 
found in Table 1. 
he interpretation of p in the last row 
of the table above is as follows: p = the 
chances in 1.00 that a ¢ as great or greater 


* 


the chances are too great that they may 
have arisen by chance. This of course 
does not prove that the obtained means 
were drawn from a homogeneous popu- 
lation of means but it as least lends sup- 
port to the view that the differences in 
the duration of the presentation of the 
stimuli may be disregarded. Accordingly 


TABLE I 


Comparison of the means of the 4 judgments obtained sa the 
conditions of 8 and 12 second presentation time. 





Rate of 
standard 
in flashes 
per sec. 10.82 


13-98 9.28 


2 3-71 2.08 


1.39 0.85 0.32 





M of $ 
judgments 


for 8 sec. 8.07 7.82 6.82 4-25 


.70 1.79 1.29 


0.88 0.60 0.32 





M of 4 
judgments 


for 12 sec. 8.33 6.81 4.04 


0.97 0.30 





Difference 
between’ 


the M’s 0. 26 0.90 0.01 0.19 


-o1 0.01 0.14 


©.09 ©.o1 





o of the 


differences 0.172 0.477 0.286 0.106 


. 238 


0.165 0.103 0.049 0.028 0.014 





t 1.51 1.89 0.03 0.97 


1.84 1.38 





b for 8 df. 
(Fisher 
and 
Yates, 


15) >O.10 >0.0§ >0.99 >0.30 >o. 


0.35 


go >0.90 >0.20 >0.10 >0.70 >0.20 





than that obtained would occur from a 
random sampling of a homogeneous 
population. For example the t value ob- 
tained for the difference between the 
means of the 1/2 judgments of the 13.98 
standard is 1.51. From the table of 
probabilities under 8 d.f. (Fisher and 
Yates, 15) it is seen that this value lies 
between 0.2 and o.1. In other words 
there are more than 10 but less than 20 
chances in 100 that a ¢ value as great or 
ereater than this could occur in a ran- 
dom sampling of a homogeneous popu- 
lation. Since none of the obtained ¢ 
values reach the 0.05 point, none of them 
may be regarded as significant. In short 


| 


it was decided to use the 12-second inter- 
val for the slower rates and the 8 second 
interval for the faster rates. 

Observers Gr. and Co. each made ten 
1/2 judgments for each of the ten 
standards. Observer Lu, made ten 1/2 
judgments for all standards except the 
slowest, on which she made seven 1/2 
judgments. Observer Re. made five judg- 
ments for all ten standards. Observer Du. 
made five judgments for all the standards 
except the slowest. 

The experimenter started the experi- 
ments with the idea that at least ten 1/2 
judgments would be needed to obtain 
consistent results. In Table 2 will be 
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TABLE 2 

The M 4 judgments for each standard for the five observers, together with the 1 

om, the o dist., the og, and V th 
Observer Gr. ; wo 

Rate of Standard. 13.98 10.82 9.28: 6.29 ~§.8t 5.73 2:68 . 1.39 0.85 © 0.53 

M 4 judgment. 8.30 7.37 6.82 4.34°° 9.970 3.86. 1.99. 0.92 0.59. @.gt 1 | 
om 0.12 0.32 0.20 0.84 ©.87 ©.12 ©.07° 0.03 0.02 6.6f Loi 
o dist. 0.38 1.06 0.64 0.44 0.§3 0.37 0.23 0.11 0.06 0.03 | 
Oe 0.08 0.24 0.14 O.10 ©6.12 0.08 0.05 0.02 0.01 0.01 | ; 
V 4-7 1.4 9:3 40581: 0.4. Big. 23% 58.8. - 10.7 = 20.48 
Observer Co. 1 | 
Rate of Standard. 3.00 9:84 8.1% S748 Aes. 8.46 Se95 2.16. 0.98... 4s a 4 
M 4 judgment. 9-10 6.84 5.82 2.87 2.03 1.42 0.96 0.69 0.41 0.29 i i} 
om 0.35 0.36 0.35 0.21 0.18 0.16 0.09 0.02 0.02 0.01 | 
o dist. 1.12 1.14 1.1% 0.67 0.56 0.52 0.28 0.09 0.07 0.04 | 
oe 0.25 0.26 0.25 0.15 ©O.12 0.12 0.06 0.02 0.01 0.01 1 ; “4 
V t2.3- 16.7: 19.0 23.44 .07.8. 36.5 -'S0:7:. 42.0 16.6: 24:2 af al 
Observer Du. 1 ie 
Rate of Standard. 14.00 10.40 9.1 6.25 5.26 3.70 2.04 1.41 0.61 | ey 
M 4 judgment. 8.85 6.72 5.27 3.54 3-37. - 2.57 4-62 0.86 0.44 1 ee: 
om 0.19 0.67 0.51 0.37 0.21 0.0§ 0.0% 0.04 0.03 | 4 
o dist. 0.43 4.51 41.14 ©.84 0.46 O.1t 0.02 0.08 0.06 iz a 
oe 0.14 0.48 0.36 0.27 -0.1§ 0.03 0.01 0.03 0.02 ij i 
V 4-0 22.4 21.6 23.7 20.0 6.8 1.5 9:9 13-9 ie 
: Observer Lu. | ie 
Rate of standard. 13-98 10.82 9.28 6.29  §.28 .3.7% 2.08 1.39 0.85 0.53 et 
M 4 judgment. 8.69 7.37 6.10. 4.8t. $68 3.58 1.393 0.94 0.5§. 0.32 : | 
om 0.27 0.16 0.26 0.18 0.16 0.10 0.08 0.05 0.02 0.02 j 4 
o dist. 0.84 0.52 0.83 0.57 0.5% 0.33 0.26 0.14 0.07. 0.04 4 ‘ 
Ge 0.19 0.12 0.19 0.13 O.1% ©0.07 0.06 0.03 0.02 0.01 i a 
V 9-6 Po2 > BBrBs.. FS The ae. 30.7: 15 XS g 298 if he 
Observer Re. Le i 
Rate of standard. 13-98 10.82 9.28 6.29 §.2% 3-71 2.08 1.39 0.85 0.53 3 ne 
M $ judgment. 8.69 .89 6.58 4-48 2.87 1.95 1.24 0.84 0.55 0.28 if 
om ©.53 0.40 0.59. 0.22 0.17 ©O.I1 0.07 0.06 0.03 0.02 4 (b 
o dist. I.I9 0.90 1.33 ©0.§0 0.37 0.25 0.17 0.14 0.07 0.02 a ; ; 
oe 0.37. 0.28 0.42 0.16 0.12 0.08 0.05 0.04 0.02 0.01 a / 
V 23.7. 12.4 90.2 48,9. 2.0 32:6 33-6 26.8. 12.9 133-9 [e 7. 
, 
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found the mean 1/2 judgments for all 
the observers together with the ¢,, for all 
the M* 1/2 judgments. 

It should not only be noted that the 
om for the 1/2 judgments based on an N 
of 10 are very small but also that the 
om for the 1/2 judgments based on an N 
of 5 are also very small. While it is true 
that the ¢,,’s based on an N of 5 tend to 
be somewhat larger than those based on 
an N of 10, it seems legitimate to con- 
clude that 5 judgments are sufficient to 
obtain a reliable result. When the very 
small N is considered it is obvious that 
the 1/2 judgment of visual rate is ex- 
tremely consistent under the conditions 
of this experiment. 

Table 2 presents the relative varia- 
bility, V, and the ¢ of the distributions 
of 1/2 judgments for all observers, for 
all standards. 

A graphical representation of the M 
1/2 judgments for the five observers is 
shown in Figures 8-12. In each figure the 
M of the 1/2 judgments has been plotted 
against its standard. The plots have been 
made on log-log coordinates, with a 
diagonal straight line indicating the 
objectively correct half. 


2) The Discontinuous Function 


When the points had been plotted an 
attempt was made to fit a curve to the 
points by eye. It was immediately obvious 
that the points could be best fitted by 
two curves rather than by one. The 
data seemed to be discontinuous. While 
the discontinuity seems plain, the pre- 
cise point at which the function breaks 
is not known. The point has been arbi- 
trarily defined as the point of inter- 
section of the two fitted curves. 

The evidence in support of this hy- 
pothesis may be summed up as follows: 


** Hereafter the symbol M will be used for the 
mean. 





a) Taking any of the curves sepa- 
rately it seems as though two negatively 
accelerated curves would fit the data 
better than any other single curve. That 
is, unless the single curve had a sharp 
flexion point, in which case the sharp 
flexion point would in itself be some 
evidence of discontinuity. In Figure 13 
will be found the data for Re. plotted 
on arithmetic coordinates (circles). Ver- 
tical lines, whose length represents 1 cy, 
have been drawn above and below their 
respective M’s. A curve has been fitted to 
the data by the method of least squares. 
The equation for this curve is given by 
the polynomial, 

y = —.31471 + .77586 x — .00717 x’. 
It will be seen that this curve misses 
seven of the obtained points by at least 
1 ¢, and misses four of the obtained 
points by at least 3 o,,. 

In Figure 14 the same data have been 
plotted in the same way. In this case, 
however, two curves were fit to the data 
by the method of least squares. ‘The equa- 
tion for the lower curve is given by the 
polynomial, 

y = —.0818 + .7491 X — .05437 x’ 
and the equation for the upper curve by 
the polynomial, 
y = —4.8853 + 1.83844 x — .o6202 x’. 


It will be seen that the two curves lie 
within 1 ¢,, of every obtained point. 

The data for Re. was chosen for this 
demonstration because she had, on the 
average, the largest ¢,,’s. 

This, of course, does not mean that 
the obtained points could not be fitted 
by a single curve merely by adding terms 
to the polynomial or by some other equa- 
tion; it does show amply, if indeed it 
was not already obvious, that no con- 
tinuous smooth curve without a sharp 
flexion point could adequately fit the 
data. 
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b) Further evidence for discontinuity 
is given by the fact that all the observers 
have curves of the same shape and all of 
them show the same evidence of dis- 
continuity. If the data were truly con- 
tinuous, it is reasonable to suppose that 
the function for at least one observer 
would appear to be continuous. 

This argument becomes particularly 
potent when the small ¢,,’s are taken 
into account. A glance at the plotted ¢,, 
for the data for Re. in Figure 13 or 14 
should convince anyone that there is 
only a very small chance that the true 
means near the point of break could 
differ by much from those actually 
obtained. 

c) Further evidence for discontinuity 
is given by the introspections of the ob- 
servers. All of the observers found the 
slower rates harder to judge than the 
faster rates. Despite the consistency of 
judgments, all of the observers reported 
low confidence for all the judgments, 
but for the slower rates even this low 
confidence seemed to disappear and the 
observers often thought that they were 
simply guessing. The experimenter noted 
that a greater number of presentations 
of the standard and variable was neces- 
sary at the slower rates before the ob- 
server was able to come to a judgment. 

All of this evidence points to the sup- 
position that the task of the observer at 
the slower: rates was different from the 
task at the faster rates. The only evi- 
dence of the nature of this difference 
so far is that the task is more difficult, 
that it takes a longer time and is ac- 
companied by less certainty. 

But all of the observers reported that 
they found the slower rates very “dif- 
ferent” from the faster rates. When asked 
if they had adhered to the instructions 
and judged rate, and not the time be- 
tween the flashes, all of them said that 





they had or thought that they had. 

Observer Co., however, reported that 
at one place in the series he seemed to 
base his “rate” judgment on the speed 
of the flashes and at the slower rates he 
seemed to base his “rate” judgments on 
“rhythm.” It would appear that the 
observer was judging different discrimin- 
able characteristics of the same stimulus 
variable. 

The experimenter then ran through 
the standard series alone and asked Co. 
to tell him when he came to the first 
standard that was judged on the basis of 
“speed.” The experimenter started at 
the slower rates and presented each 
standard in succession. When he came 
to the standard of 4.53 per second Co. 
said that it was the first that was judged 
on the basis of speed. By referring to the 
graph for this observer, Figure 8, it can 
be seen that this is the first standard 
above the point of break. The experi- 
menter then started at the faster rates 
and ran through the series and asked Co. 
to tell him when he came to the first 
standard that was judged on the basis 
of “rhythm.” When the experimenter 
came to the standard of 3.16 per second 
Co. said that it was judged on the basis 
of rhythm. By referring to Figure 8 again 
it will be seen that this is the first stand- 
ard below the point of break. 


Furthermore observer Du. reported a- 


very real difference in his perception of 
the slower and faster rates. When asked 
if the words “speed” and “rhythm” 
could be used to describe the difference 
between the two perceptions, the ob- 
server said that they could. He was not 
so sure that the word “rhythm” was as 
adequate to describe the slower rates as 
the word. “speed’’ was to describe the 
faster ones. The experimenter presented 
the standards to this observer as he had 
for Co. Only a descending series was 
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given. The observer reported that the 
rates of 13.98, 10.82 and 9.28 could only 
be seen as “speeds,” and that the rates 
of 6.29 and 5.21 could sometimes be seen 
as “rhythms” but were usually seen as 
speed and that below this point all of 
the standards were seen as rhythms. The 
break in Du’s function occurs between 
5-21 and 3.71. 

It will be noted that over a certain 
range the standard would appear as a 
“speed” and the variable as a “rhythm.” 
For these observers, Co. and Du., the 
average 1/2 judgments for the three 
fastest rates are seen as speeds. The 
average 1/2 judgments for the next two 
standards are seen as rhythms. In other 
words, the standards are seen as one 
thing and the variables as another. Since 
the break in the function occurs below 
this point (the point of discontinuity), 
both the standards and the rates that 
were judged 1/2 are seen as rhythms. 

Co, claimed that when the standard 
was seen as a “speed” the comparison 
stimulus also tended to be seen as a 
speed. 

The evidence above seems to point 
clearly to the fact that the observers are, 
in reality, judging two different charac- 
teristics of the flashing lamp stimulus. 
The observers are judging a “speed’”’ 
when the lamp is flashing at a rate faster 
than 4.00+ per second; and they are 
judging “rhythms” when the lamp is 
flashing at a rate slower than 4.00+ per 
second. 

The interpretation of the words 
“speed” and “rhythm” will be left to a 
later section. 

d) Further evidence of discontinuity 
is offered by the relative variability of 
the 1/2 judgments, which changes char- 
acteristically at the point of discon- 
tinuity. 


The coefficient of variability, V, for 


the 1/2 judgments has been plotted 
against the standard rates for all the 
observers. These plots are seen in 
Figures 15-19. The point where the 1/2 
judgment plots. break is indicated in 
these figures by a carat near the 
abscissa. The points have been plotted 
on semi-logarithmic coordinates. 

It will be noted that all of these func- 
tions, except one, are strikingly similar 
in at least one respect. There is a large 
increase in relative variability at and 
around the point of break, followed by 
a sharp decrease. 

This sharp increase in relative varia- 
bility, followed by a decrease, could be 
due to several things. One possible cause 
would be the confusion caused the ob- 
server by the perception of the standard 
as a “speed’”’ and the variable as a 
“rhythm.” If this were the cause it might 
be expected that the highest variability 
would occur only in the 1/2 judgments 
of the two slowest “speed” standards. 
This, however, is not generally the case. 

Another rather obvious possibility is 
that the standards near the point of 
break are sometimes seen as rhythm and 
sometimes as speed. Furthermore the 
variable for the two slowest “speed 
standards might sometimes be seen as a 
“rhythm” and sometimes as a “speed.” 
The introspections of Du. lend a little 
support to this theory. He noted, it will 
be remembered, that at least two of the 
standards could be seen both ways, al- 
though it was more natural for him to 
see them as “speeds.” 

It was said above that four of the 
graphs were similar, at least in that they 
showed a sharp rise in relative variability 
at the point of break and a drop after 
the point of break. If the V functions 
of the observers Gr., Co. and Lu, are 


” 


’ examined it will be seen that they are 


similar in showing a rather regular in- 
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crease in relative variability as the break 
is approached and a rather regular de- 
crease after the point of break. Du., 
while he shows an increase, does not 
show the same gradual increase, nor the 
same gradual decrease after the break. 
Re. shows neither the increase nor the 
decrease. The experimenter knows no 
reason why these observers do not show 
the same functions as the other ob- 
servers, unless it be that the o’s of these 
two are based on an N of 5, whereas the 
o’s of the other observers are based on an 
N of 10. In short it is possible that these 
two differ from the “true” function be- 
cause of the relatively less reliable o’s. 
There is a greater chance that their o’s 
are not the true a’s. 

An examination of the co’s lends little 
support to this explanation. Both Du. 
and Re. have relatively larger co’s for the 
judgments for the faster rates but, on 
the other hand, the co's for the slower 
rates are approximately the same as those 
of the other observers. 

If the true shape of the relative varia- 
bility function is that shown by Gr., Co. 
and Lu., support would be given to the 
hypothesis that the stimuli at the point 
of break are sometimes seen as speeds 
and sometimes seen as rhythms. The 
further the stimulus is from the point of 
break the greater is the likelihood that 
it will always be seen as either one or 
the other and not as a mixture of both. 

It is obvious that the support of the 
hypothesis of discontinuity given by the 
relative variability data is based on an 
assumption. The assumption is that the 
relative variability would not first in- 
crease, then decrease, in the manner 
shown in Figures 15-19 if the data were 
continuous. It seems fairly safe to make 
this assumption. While it is true that, in 
advance of knowledge, the relative varia- 
bility function for a continuous 1/2 


judgment function might be of any 
shape, it is improbable that it would 
show any sharp break. 


3) The Construction of Magnitude 
Functions for Discontinuous Data 


The construction of magnitude func- 
tions from discontinuous 1/2 judgment 
functions presents two types of special 
problems not found in the construction 
of the functions from continuous 1/2 
judgment functions. The first problem 
is theoretical and the second is practical. 

a) Theoretical. The first question that 
must be answered if the 1/2 judgment 
function is discontinuous is: should there 
be two magnitude functions or one? 
Campbell (6) claims that magnitudes are 
the same if the order generated between 
the systems is the same, regardless of the 
fact that the two orders are established 
by different operations. If the magnitudes 
are additive, they are the same magni- 
tudes or are magnitudes of the same 


kind, if the operation for addition is the 


same. Campbell says that the magnitudes 
are the same or that they are magnitudes 
of the same kind because of some com- 
mon property that determines the order 
in both cases. 

It is necessary to examine this proposi- 
tion of Campbell's further. Take for ex- 
ample the work of Taves (44) who has 
constructed a magnitude function for 
numerousness. If he had also constructed 
a magnitude function for density, with 
area constant, it is obvious that the order 
between the systems would be the same 
order as that for numerousness. Camp- 
bell would say that numerousness and 
density were the same magnitudes or 
magnitudes of the same kind. Yet ‘Taves 
argues that density and numerousness 
are different magnitudes. He bases this 
argument on the fact that the 1/2 judg- 
ment function for density obtained 
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under the conditions of his experiment 
has a different shape from the 1/2 judg- 
ment function for numerousness. Is there 
a contradiction between Taves’ position 
and that of Campbell? 

It is true that these magnitudes are 
magnitudes of the same kind in the 
sense in which Campbell uses “same.” 
Numerousness is a function of the num- 
ber of stimulus dots; density (area con- 
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stant) would also be a function of the 
number of stimulus dots, There is a 
common factor in the two series, al- 
though the subjective operations are dif- 
ferent. In one case the observer is in- 
structed to judge “numerousness” and, 
in the other, “density.” 

But the magnitudes are different in 
another sense. To construct a hypo- 
thetical example from physics, suppose 
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Fics, 15-19. Relative variability of the half 
judgments as a function of the rate of the 
standard Ge ee coordinates) for 
five observers. The abscissa represents ob- 
jective rate, flashes per second, of the stand- 
ard and the ordinate represents the coeffi- 
cient of variability. The point of discontinu- 
ity as determined from the half judgment 
function is indicated by a carat. 


APPLICATION OF PHYSICAL MEASUREMENT TO PSYCHOLOGICAL MAGNITUDES 63 


that two B-magnitude scales have been 
constructed for temperature, one by 
means of the mercury thermometer and 
another by any other method. The order 
of the systems scaled by the two opera- 
tions will be the same. However the 
functional relation between the two 
scales and a third measurable magnitude 
may be different. One might have a 
linear relation to some third magnitude 
and the other might have an exponential 
relation to the same magnitude; in other 
words there may not be a linear relation 
between the two B-magnitudes. 

The magnitudes are the same in the 
sense that the order is the result of a 
common property—temperature. The 
fact that the relation between the two 
magnitudes is not linear points to the 
fact that they must be different in some 
other sense. The order obtained is not 
only a result of the common factor but 
also of the operations used to construct 
the scale. The sameness of the order can 
be said to be due to the common factor; 
the lack of linearity can be said to be 
due to characteristics of the operations. 


To return to numerousness and den-. 


sity (area constant), in one case the ob- 
server is asked to judge the perceived 
number of dots and in the other case 
he is asked to judge the perceived num- 
ber of dots per subjective unit area. The 
magnitudes are the same in Campbell's 
sense but they are different in the sense 
that the operation performed on the 
stimuli in the case of numerousness does 
not include the characteristics of sub- 
jective unit area. 

This should give the clue for answer- 
ing the question of whether there should 
be two magnitude functions or one mag- 
nitude function, when the 1/2 judgment 
data is discontinuous. There might be 
_ ways of telling whether discontinuity is 
evidence for two magnitudes: 


1) It is possible to check the hypoth- 
esis that two magnitudes exist by chang- 
ing the instructions. If the observers 
report that at one time they are judging 
“rhythm” and at the other “speed,” it is 
possible to instruct them concerning the 
nature of the two characteristics and tell 
them to be sure to judge only the 
“rhythm” characteristic until it was abso- 
lutely essential to shift to “speed.” Then 
the process would be reversed; the ob- 
server would be instructed to judge only 
the “speed” characteristic until it -was 
absolutely necessary to change to 
“rhythm.” If the change in instructions 
resulted in a shift of the point of break 
it could be presumed that there were two 
magnitudes. 

2) Evidence of the existence of two 
characteristics could be gained from the 
verbal responses of the observers. If they 
describe two separate characteristics and 
the verbal description checks with the 
obtained data it may be presumed that 
there are two magnitudes. 

b) Practical. If two magnitude func- 
tions are constructed there is no real 
practical problem. The functions may be 
constructed on the same coordinates or 
on different coordinates. There will be 
a separate point of origin for each curve. 

If a single magnitude function is neces- 
sary the method is somewhat more com- 
plicated. 

In Figure 20 the solid line represents 
some hypothetical 1/2 judgment data 
plotted on log-log coordinates. The form 
of the function and the type of discon- 
tinuity is the same as that for numerous- 
ness (Taves, 44). After assigning the 
numeral 1 to the physical stimulus value 
1, successive points of the magnitude 
function may be plotted in the usual 
way. The resulting curves will be those 
shown by the solid lines in Figure 21. 
The type of break shown in Figure 20 


a ee 


RS Sey 
eae Ricca? Pieapetee 
Se ee (EB Se 


_ ~ a ee 
0 ee ae 





= s ae 











1 ¥ + Fe ee eee ¥ , © Pe eetee 


~F F Fe 


, F Pee Lj 


TY Peery 





JUDGED ONE HALF (physical urits) 





STANDARD (physical units) 


Fic. 20. Hypothetical half judgment function showing discontinuity 
(see text). The coordinates are logarithmic. 
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Fic, 21. Magnitude function showing discontinuity (see text). 
The coordinates are logarithmic. 
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Fic. 22. Magnitude 
function showing mul- 
tiple discontinuity (see 
text). The coordinates 
are logarithmic. 
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demands two values for the stimulus of 
7 physical units. From Figure 20 it can 
be seen that 2.9 has been judged to be 
1/2 of 7, and 4 has been judged to be 
1/2 of 7. Therefore 7, which is the point 
of break, must have two subjective mag- 
nitudes. The question immediately 
arises: is this a practical possibility? If 
the observer judged 2.9 to be 1/2 of 7 
half the time, and judged 4 to be 1/2 
of 7 half the time, the mean 1/2 judg- 
ment for 7 would lie half way between 
2.9 and 4. This is perfectly true. The only 
reason the break has been so drawn is to 
simplify the technique. Actually the 
break in the function will be in the 
nature of a steep curve between the 
values of, say, 6 and 8. However this 
does not change the basic principles that 
are being discussed. 

In the discussion in Part I it was stated 
that the magnitude function served two 
purposes, first, to describe the functional 
relation between the subjective and ob- 
jective magnitudes; second, to permit 
estimates of intermediate magnitudes by 
interpolation. 

It was not mentioned that there is an 
objective check on this interpolated esti- 
mate. For example the stimulus of 8.5 
physical units has a subjective magnitude 
of 28. The physical magnitude associated 
with 1/2 of this subjective magnitude is 
5.0. Therefore 5.0 ought to have been 
judged to be 1/2 of 8.5: Turning to the 
1/2 judgment function it is seen that 
5-0 was not judged to be 1/2 of 8.5 but 
of 10.5. 

It seems then that something is wrong 
with the magnitude function as con- 
structed, It is in a sense not internally 
consistent when tested against the 1/2 
judgment function. The first segment of 
the curve, that going from 1 to 7 physical 
units, is internally consistent. If the re- 
mainder of the magnitude function is 


actually plotted from the lower segment 
of the curve, i.e., from 1 to 7, rather than 
by simply drawing the best fitting curve 
between the obtained points, the follow- 
ing operations would be _ performed: 
Going to the 1/2 judgment function it is 
seen that 5.4 was judged to be 1/2 of 12 
and, therefore, 12 should be assigned 
twice the subjective magnitude that was 
assigned to 5.4, and so on. The result 
will be 5 discontinuous functions. These 
functions are shown in Figure 22 by the 
solid curves. The magnitude function 
shown in Figure 21 has been drawn in 
in dotted lines. The amount by which 
the five discontinuous curves deviate 
from the magnitude function constructed 
in the usual manner is a measure of the 
lack of consistency of the function shown 
in Figure 21. 

The function consisting of the five 
discontinuous curves truly reflects the 
relation between subjective magnitude 
and physical magnitude. The function 
is also internally consistent. Yet it would 
seem that there ought to be only one 
break in the magnitude function. There 
is only one break in the 1/2 judgment 
function. There ought to be no sudden 
jump in subjective magnitude at 18.5 
physical units nor at 95 physical units, 
etc. The observer gives no indication of 
this in his actual judgments, as reflected 
in the 1/2 judgment function. On 
examination it is clear that the function 
in Figure 22 breaks as a multiple of 7, 
the original point of break. 7 was judged 
to be 1/2 of 18.5 and the function breaks 
at 18.5; 18.5 was judged to be 1/2 of 95 
and the function breaks at 95; 95 was 
judged to be 1/2 of 1,430 and the func- 
tion breaks at 1,430. 

In other words the successive breaks 
were necessitated by the fact that there 
was a break at 7. 

In the discussion of the discontinuity 
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of the visual rate function it was men- 
tioned that over a certain range the 
standard would be perceived as a “speed” 
and the variable as a “rhythm.” Over the 
rest of the range the standard and 
variable would be perceived as a 
“rhythm” and over another range they 
would both be perceived as a “speed.” 
If this fact is applied to the hypo- 
thetical data in Figure go, it will be seen 
that the range in which the standard 
and variable are perceived differently 
must extend from 7 to 18.5. If this is so, 
is it reasonable that the curve from 7 to 
18.5 is continuous with the curve from 
18.5 to 4,000? It would seem that this is 
impossible. Since the effect of the break 
is to cause an increase in subjective mag- 
nitude, when the standard is a “speed” 
and the variable a “rhythm,” the ob- 
server must select a variable that has a 
magnitude relatively greater than that 
which he selected when both the stand- 
ard and variable lie in the upper range. 
The final answer to the problem is 
now clear. The hypothetical data in 
Figure 20 could never have occurred. 
If there is one break in the 1/2 judg- 
ment there must be another break. ‘The 
necessary shape of this middle segment 
is shown by the dotted line in Figure 20. 
It must be remembered that this shape 
is dependent upon the form of the break 
at 7, and that the form shown was used 
for illustrative purposes only. Actually 
it is reasonable to expect that the break 
will never be as sharp as the one shown. 
The result of the double break is to give 
a magnitude function that has only one 
discontinuity, that truly represents the 
relation between the subjective magni- 
tude and the stimulus magnitude, and 
that is internally consistent. 
This function is that shown in Figure 
21 and by the dotted lines in Figure 22. 
The evidence points to the fact’ that 


the break in the 1/2 judgment function 
for visual rate is due to the judgment of 
two different characteristics or, in other 
words, that there are two different mag- 
nitudes. Therefore two magnitude func- 
tions have been constructed. The origin 
of the first function is taken at 50 on the 
ordinate and 0.5 on the abscissa. That is 
to say; in constructing the magnitude 
function 50 subjective units were arbi- 
trarily assigned to a rate of 0.5 per sec. 
The origin of the second function has 
been taken on the abscissa to be the point 
of break of the 1/2 judgment function; 
and on the ordinate to be the subjective 
magnitude assigned to the rhythm mag- 
nitude at the point of break. The points 
of origin of the second function, in terms 
of abscissa and ordinate, respectively, 
were for Co. 4.3 and 982; Lu. 2.65 and 
700; Du. 4.09 and 620; Re. 4.8 and 800; 
Gr. 4.7 and 1,000. The two curves have 
been extrapolated for a short distance 
so that they intersect each other. This 
was done to emphasize the fact that they 
are functions for different magnitudes. 

The magnitude functions for each of 
the observers are shown in Figures 23-27. 
The curves are plotted on log-log co- 
ordinates. 


SECTION D. DISCUSSION OF RESULTS 


Dunlap (10), using both visual and 
auditory stimuli and employing the 
method of constant stimuli, obtained the 
difference limen for both time and rate. 
Under one condition the observers 
judged the time between the stimuli and 
under the other they judged the rate of 
the stimuli. He found the difference 
limen for two standard stimuli for one 
of which the stimuli were presented at 
0.232 second intervals (a rate of 4.31) 
and for the other the interval was 0.4355 
seconds (a rate of 2.296). 

He found that the sensitivity for rate 
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_tions for visual rate for. five observers 
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the flashing lamp in flashes per sec- 
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time for the lower segment of the 
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was greater than for time. The psycho- 
metric functions for rate were not only 
steeper than for those of time but they 
were more regular, Twenty to forty 
judgments gave a smooth curve for rate 
whereas the time curves were not smooth 
even when more than forty judgments 
were made. Dunlap concludes from this 














100; T T pore T T T T 
To “4 
x 
ru) 
5 50r * 
aa “ 
# 
25r 
0 l l l rl L l l 1 i 
TIME .224 232 .240 .248 .256 
RATE 446 43) 417 403 390 


Fic, 28. Psychometric function for temporal 
intervals (from Dunlap, 100). The abscissa rep- 
resents temporal intervals, in o, between two 
auditory stimuli. A scale of rate, sounder clicks 
per second, for the corresponding temporal inter- 
vals has been added to facilitate comparison with 
the present study. The ordinate represents the 
percentage of times a given interval was judged 
to be longer than the standard interval of 2320. 


(p. 51) that, “These differences favor the 
supposition that the rate-judgment is not 
essentially a judgment of the interval 
between stimulations.” 

He then points out the possibility that 
the irregularities in the time functions 
are not due to poor judgment on the 
part of the observers. The irregularities 
in the curves are strikingly similar. They 
occur in 8 of the functions, 6 of which 
are for auditory stimuli separated by 
0.232 sec. (rate of 4.31). Dunlap did not 
use visual time judgments so no data is 
available for the judgment of time inter- 

‘ Y 


vals that are “bounded” by visual stimu- 
lations. The other two curves in which 
these irregularities occur are for stimuli 
separated by 0.4355 sec. (rate of 2.296). 
One is for the judgment of visual rate 
and the other for an auditory time 
judgment. 

An example of one of Dunlap’s curves 
(Dunlap 3, I) is shown in Figure 28. 
Dunlap calculated his limen in an un- 
usual manner. 

But the more usual plot may be 
readily obtained. The number of greater 
or less judgments can be found by solv- 
ing simultaneously the equations, 

G+L=N 
G—L=N’. 
G refers to the number of judgments 
of “Greater” and L to the number of 
judgments of “Less.” N refers to the total 
number of judgments. N’ is the number 
of times judgments of “Greater” exceeded 
judgments of “Less.” One can then plot 
the percentage of greater judgments 
against the variable and obtain a psycho- 
metric function in accord with common 
practice. Figure 28 shows the percentage 
of greater judgments (i.e., judgments 
that the time between the stimuli of the 
variable was longer than that between 
the standard stimuli) plotted against the 
stimulus variables. The nature of the 
irregularity is clearly seen. There seem 
to be two distinct sigmoid functions, It 
will be remembered that the standard 


rate was 4.31 per sec. In six of the curves 


obtained by Dunlap the standard was 
presented at this rate and all of the 
breaks in his functions occurred between 
the rates of 4.54 and 4.03. 

By referring to Figures 8-12 it will be 
seen that four of the five 1/2 judgment 
functions obtained in the present experi- 
ment broke between the rates of 4.95 
and 4.18. From the quotation from Dun- 





APPLICATION OF PHYSICAL MEASUREMENT TO PSYCHOLOGICAL MAGNITUDES 69 


lap given above he seems to have sus- 
pected that the “irregularity” of his 
psychometric functions for time might 
be due to something more than unre- 
liable discrimination. The analysis of 
the data obtained by Dunlap for the 
time judgment gives added support to 
the contention that there is normally a 
change from the discriminable charac- 
teristic “speed” to that of “rhythm” (the 
meanings of these two words are ex- 
panded below) in the neighborhood of 
4-00-5.00 stimulus presentations per sec. 

In the section above introspective evi- 
dence was offered to support the hypoth- 
esis that the judgments of the faster 
and slower rates were based on different 
discriminable characteristics. 

These characteristics were named 
“rhythm” and “speed” by the observer 
Co. It is now necessary to attempt to 
interpret these two terms. One thing 
was certain from the beginning “rhythm” 
was not used by Co. with the usual con- 
notations. When Re., Du., Co. and Gr. 
were asked to verbalize the two charac- 
teristics more fully, they replied in sub- 
stance that: “When the light flashes at 
a rapid rate it seems to be there all the 
time. When the light is flashing at a 
slower rate it is not there all the time. 
There are blank intervals bounded by 
the flashes.” 

Tinker (49) has stated the difference 
between rapid and slow rates as follows, 
“The rate, however, must not be too 
slow, for in that case the perception of 
the interval betwen the qualities or 
events tends to intrude and: to become 
the characteristic feature of the percep- 
tion, and the succession as such becomes 
difficult to perceive.” 

The experimenter believes that the 
so-called “speed” judgment was really a 
judgment of “speed” or ‘‘rate’”’ but that 
the so called “rhythm” judgment was a 


judgment of the interval between the 
flashes. The question now arises, why do 
the observers cease judging on the basis 
of rate at the particular point where 
they do? 

Dunlap (10) has reported that when 
flashes of light are of equal length the 
smallest perceptible interval between the 
flashes ranges from 0.004 sec. to 0.0198 
sec. The durations of the flashes of light 
ranged from 0.0198 to 0.0729. The 
shorter time intervals between the flashes 
were associated with the longer dura- 
tions. Dunlap notes that intensity is an 
important variable in this determination. 

These intervals were extremely small. 
If the observers are able to perceive time 
intervals as small as this, why do the 
observers in the present experiment 
change from a time to a rate judgment 
when the durations are approximately 
0.222? The experimenter believes that 
the answer has been given by James (27): 
“To be conscious of a time interval at 
all is one thing; to tell whether it be 
shorter or longer than another interval 
is a different thing.’ In short when the 
interval reaches a certain duration any 
further decrease is imperceptible. The 
subjective magnitude of time becomes 
asymptotic to the absolute threshold and 
the subjective magnitude of rate will 
become asymptotic to the upper limit. 
As the flashes increase and approach the 
fusion point the subjective magnitude of 
rate should be negatively accelerated. It 
will be asymptotic to the fusion point. 

This does not mean that the observers 
in the present experiment judged on the 
basis of time as long as they could. It is 
the opinion of the experimenter that 
they judged on the basis of rate as long 
as it was easy to do this and, conversely, 
that they judged on the basis of time as 
long as it was easy to do that. The hypoth- 
esis is suggested that at a certain point 
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it is equally difficult for the observer to 
judge on either basis. It is possible that 
at this point variability should be at its 
highest. This does not mean that the 
point of break in the 1/2 judgment func- 
tions marks the point above which the 
observers cannot discriminate differences 
in time and below which the observer 
cannot discriminate difference in rate. It 
is the point at which it is no longer very 
easy for the observer to continue to dis- 
criminate either time or rate. 

There is still another possible ex- 
planation of the discontinuity. There is 
some change in brilliance as a function 
of rate. It is possible that the integrated 
intensity of the flashes begins to produce 
an effect on behavior in the neighbor- 
hood of 4.5 per sec. At the present time 
the author can do no more than present 
the possibility as he has no direct experi- 
mental evidence to either refute or sub- 
stantiate it. 


SECTION E. SUMMARY 


1) The problem was to construct an 
equal unit scale for visual rate (the per- 
ceived rate of the flash of a lamp) by 
the method of fractionation and to de- 


termine the relation of visual rate so 
scaled to the stimulus magnitude. 

2) The 1/2 judgment function showed 
marked evidences of discontinuity. The 
hypothesis that the 1/2 judgment func- 
tion for visual rate is discontinuous is 
supported by a curve fitting technique, 
by the fact that all the observers show 
the break, by the break in the relative 
variability functions and by the intro- 
spections of the observers. 

3) The discontinuity is attributed to 
the possibility that the observers may 
have been judging time at the slower 
rates and “speed” or rate at the faster 
rates. 

4) A solution was offered for the 
theoretical and practical difficulties in- 
herent in the construction of magnitude 
functions for discontinuous data. - 

5) The extremely small variability of 
the 1/2 judgments and the consistency 
between the observers in respect of the 
shapes of the 1/2 judgment curves lead 
to the belief that the results are reliable. 

6) An equal unit scale which would 
meet the criteria for ordinal scales and 
for equal units has been successfully 
constructed for visual rate. 











Part IV 


EXPERIMENTAL: THE SCALING OF SUBJECTIVE, DIFFICULTY OF DiGiT SERIES 


SECTION A. THE PROBLEM 


HE PROBLEM is to discover whether 
ic equal unit scale can be con- 
structed for the subjective difficulty of 
memorizing and recalling digit series. If 
a scale for subjective difficulty for digit 
series can be constructed, it will be pos- 
sible to determine the relation between 
the subjective difficulty and the number 
of digits in the series. The scale could 
also be compared to the objectively de- 
termined incorrectness of the repro- 
duced digits and to the subjectively 
determined incorrectness of the repro- 
duced digits. The objectively determined 


incorrectness is defined as the percentage - 


of the series of any given length that is 
reproduced incorrectly and the subjec- 
tive incorrectness is defined as the per- 
centage of the series of any given length 
that the observer thinks he reproduces 
incorrectly. 


SECTION B. DISCUSSION OF THE PROBLEM 
1) Digit Span 

The digit span has been used as a test 
of memory for some time. It was in- 
cluded in the Binet-Simon tests (2). Per- 
haps digit span finds its greatest practical 
use in present day psychology in various 


well known intelligence tests. In the. 


Stanford Revision of the Binet-Simon 
Tests (45) a series of 3 digits is placed at 
the three year level; four digits at year 
four; five digits at year seven; six digits 
at year ten; seven digits at year four- 
teen; and eight digits at the superior 
adult level. 

In the Revised Stanford-Binet Scale 
(46) two digits are placed at two years 


we & a 


and six months; three digits at three 
years; four digits at four years and six 
months; five digits at seven years; six 
digits at ten years; eight digits at 
Superior Adult II and nine digits at 
Superior Adult III. 

The practice in the Stanford-Binet 

Tests is to count the test as passed if one 
out of three of the digit series at any 
given level is correct. 
_ Kuhlmann’s Tests of Mental Develop- 
ment include a digit series of two at two 
years two months and_.one of five at five 
years one month. The directions are not 
the same for the two year-levels. 

There have been several determina- 
tions of auditory memory span for 
digits at the adult level. Carothers (8) 
found an auditory span of 7.53 + 0.53 
for a group of women college students 
(N = 200). Garrett (18), using a group 
of 158 male college students, obtained 
a span of 8.4. Martin and Fernberger (30) 
report that college students are able to 
increase their digit span about 20 per 
cent with practice. 

The method of constant stimuli pro- 
vides a more accurate technique for ob- 
‘taining the memory span for digits than 
that of mental test technique. The digit 
span obtained by the mental test tech- 
nique does not have the same meaning 
as the digit span obtained by the method 
of constant stimuli. For example, on the 
Stanford-Binet the test is passed if the 
observer reproduces one series correctly 
out of three. All that can be said of the 
result is that the observer was at the 
time of examination at least capable of 
reproducing one series of a certain 
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length, when given three trials. When 
the span is computed by the method of 
constant stimuli a large number of 
series of different length is given to the 
observer and the percentage of correct 
(or incorrect) reproductions is plotted 
against the number of digits in the 
series. The median correct series and 
the P.E. can be obtained graphically; 
and the M and ¢ can be obtained by any 
one of several methods, (Woodworth 52, 
p. 402 ff.). 


2) Subjective Difficulty 


The usual method of measuring the 
dificulty of a mental test item is in 
terms of the frequency of correct re- 
sponse. A difficult item is defined as an 
item that is passed by relatively few 
persons. An easy item is defined as one 
that is passed by a relatively large num- 
ber of persons. The results of the meas- 
urement of difficulty defined in this 
manner are expressed in percentages or 
7 units or ¢ scores or other measures de- 
pending upon the frequency of passing 
or failing the item. It has been suggested 
(51) that “difficulty” is a poor name for 
this phenomenon. For example, “We 
may, then, conclude that the concept of 
difficulty (in its objective aspects) should 
for purposes of precise statement be re- 
named: and we suggest some such phrase 
as ‘incidence of successful performance’ 
(under specified conditions) as relatively 
free from the possibility of misinterpre- 
tation through assumption of more or 
less magical properties to units of the 
task itself... . .” Difficulty defined in 
terms of frequency of failure will be 
referred to hereafter as objective 
difficulty. | 

There is, of course, another kind. of 
difficulty which is part of the experience 
of the observer. If an observer is asked 
if an item is difficult he is able to give a 


V 


rather definite reply. If asked to explain 
the basis of his judgment he may give a 
variety of material. A list of some of 
these introspections follows; the list is 
based partly on the introspections of the 
observers in this and the next experi- 
ment: 

1) Estimation of how difficult the task 
would be for a large number of people 
(“difficult” being defined objectively). 

2) Estimate of the correctness of the 
observer’s own answer. 

3) Confidence in final answer after the 
task is completed. 

4) Confidence that a correct reply will 
be obtained during the course of the 
reproduction, 

5) Lack of familiarity with the type of 
task (the task may be subjectively difficult 
because the observer has never had any 
experience with the material). 

6) Length of time to solve the prob- 
lem. 

7) Complication of the problem (i.e., 
the problem may be long. and intricate 
but objectively easy). 

8) Feelings of strain and effort. 

9) Feelings of indecision. 

it can be seen that many of the above 
criteria for subjective difficulty overlap. 
It can also be seen that some of them are 
based directly on an estimate of the 
correctness or incorrectness of the ob- 
servers’ final reply. The experimenter 
did not wish to define subjective diff- 
culty on the basis of the observers’ direct 
estimate of the correctness of the final 
answer. He wished to scale the difficulty 
of doing the item. In other words the 
observers were asked to use as a criterion 
the “difficulty” they experienced in 
reaching a solution no matter whether 
they thought that that solution was cor- 
rect or not. 

Beyond this and a few cautions against 
false criteria no definition of subjective 
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difficulty was attempted. The experi- 
menter did not wish to adopt a rigid 
definition of difficulty and then warn 
the observers not to judge on the basis 
of this or that criterion. He felt that if 
this was done there might be no criteria 
left on which the observers could base 
their judgments. Actually the observers 
reported approximately the same cri- 
teria, numbers 4, 6, 7, 8 and g above. 
But these criteria were not equally 
effective for all the observers. An ob- 
server would report, for example, that he 
used 4, 6, 7 and 8 but that 6 was the 
most important determiner of the judg- 
ment. It is the opinion of the experi- 
menter that all of these so called “cri- 
teria” are really different discriminable 
characteristics that are closely related to 
each other. They are magnitudes of the 
same kind. When the observer is asked 
to make a judgment of “subjective dif_i- 
culty” he actually judges one or more of 
these characteristics. Some of these have 
been excluded by the instructions not to 
use “correctness” as a basis for judgment. 
The relative importance of certain cri- 
teria will not only be a function of indi- 
vidual differences but also of the 
material. 

Farmer (12) found that there was more 
subjective difficulty for objectively diffi- 
cult tasks than for objectively easy tasks, 
even when the observer did not know 
the objective difficulty of the task. Hertz- 
man (25) found rank order correlations 
ranging from 0.50 to 0.86 between 
confidence ratings and the objective 
difficulty of memory items that were cor- 
rectly matched. Hertzman defines sub- 
jective difficulty by “confidence” (p. 114). 


SECTION C. PROCEDURE 


1) The observers were given a sheet of 
paper made up as follows: 


Errors 
Standard Variable Judgment in 

repro. 
1 GYy%L —— 


There were enough of these spaces 
for all the digit series given in a single 
experimental session. The observer was 
given a copy of the following instruc- 
tions, which he was asked to read while 
the experimenter read them to him: 





I am going to read a series of digits to 
you and when I have finished, I want you 
to write it from memory in this space (space 
for standard). I am then going to read an- 
other series to you and when I have finished, 
I want you to write it in this space (space 
for variable). Then I want you to judge 
whether the second series of digits was half 
as hard for you to remember as the first, or 
whether it was more or less than half as 
hard. If the second series seemed half as 
hard, circle 14, if it seemed more than half 
as hard as the first, circle G, and if the second 
series seemed less than half as hard as the 
first, circle L. When you have finished mak- 
ing this judgment of the relative difficulty of 
the second series, will you put an X in this 
column if you think you have made an error 
in reproducing the first series; and put an X 
in this column if you think you have made 
an error in reproducing the second series. 

Don't worry about mistakes or what you 
think are mistakes as I shall give you quite 
a few series of digits that are too long for 
you to remember. First, try as hard as you 
can to learn each series, then make the 
judgment of difficulty and after that register 
your impression as to whether each of the 
series were reproduced correctly. Do each 
thing in the order in which I have given it. 

N.B. You are to judge whether the second 
series is half as difficult to learn as the first, 
or whether it is more than half as difficult 
or less than half as difficult, NOT whether 
the second series is as difficult as the first or 
more difficult or less difficult. 

To look at it another way, the nearer the 
second series approaches the first in difficulty 
the more likely it is to be more than half 
as hard, and the easier the second series is in 
reference to the first the more likely it is to 
be less than half as hard. 
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An example of this type of judgment 
was given to the observers using the 
length of lines. The observers were then 
warned against: 1) judging the second 
series on the basis of the number of 
digits it contains; 2) attempting to be 
consistent for the sake of being con- 
sistent; 3) using their estimate of the 
correctness of the digit series as the basis 
of the judgment. 

When the instructions had been read 
by the observer he was encouraged to 
ask as many questions as he wished. ‘The 
same principle was adopted in this ex- 
periment that was adopted in the experi- 
ment on visual rate: it is more important 
that the observers understand the in- 
structions than that they be given the 
same instructions word for word. 

The observers were then given 39 
groups of standard and variable. This 
number of judgments constituted one 
experimental session. There were twenty 
experimental sessions for each observer. 
Each observer made, therefore, 780 
judgments and memorized or tried to 
memorize 1,560 digit series. 

The standards ranged in length from 
12 digits to 5 digits. During each experi- 
mental session the-observers judged 6, 7, 
8, 9, 10 and 11 digits against a standard 
of 12; 5, 6, 7, 8, 9 and 10 digits against 
a standard of 11; 4, 5, 6, 7, 8 and g digits 
against a standard of 10; 4, 5, 6, 7 and 8 
digits against a standard of 9; 3, 4, 5, 6 
and 7 against a standard of 8; 3, 4, 5 and 
6 against a standard of 7; 2, 3, 4 and 5 
against a standard of 6; and 2, 3 and 4 
against a standard of 5 digits. 

All the standards of any one length 
were presented together, i.e., all the 
judgments against the standard of 12 
were completed before going on to the 
next standard. The variable series judged 
against a given standard were presented 
in a random order. The order in which 


the groups of standards were presented 
was randomized with respect to the daily 
sessions. 

The digits were presented at approxi- 
mately one per second in an even tone 
of voice. Rest periods were allowed the 
observers when they requested them. 

The digit series were constructed by 
the experimenter. No two numerals that 
are adjacent in the numeral series were 
ever adjacent in the digit series, and no 
regular progressions, such as 3, 5, 7, were 
used. 780 of such series were constructed. 
When they had been used once, which 
required 10 daily sessions, the 780 were 
repeated. 


SECTION D. RESULTS 


In the case of each standard the per- 
centages of greater-than-one-half judg- 
ments were plotted for each of the 
variables. Probability coordinates were 
used (24). The points were fitted with a 
curve by eye and the median 1/2 judg- 
ment was read directly from the curve. 

The data from one of the observers 
was fitted by three different methods, best 
fitting straight line (by eye), best fitting 
curve (by eye) and best fitting straight 
line by the Miiller-Urban weights. 

The comparison of the methods con- 
vinced the experimenter that nothing 
was to be gained by using the M rather 
than the median and that there was 
little to choose between the rectilinear 
and the curvilinear fit. The chief differ- 
ence between the straight line and the 
curve lies at the ends of the distribution. 
The change in the median values was 
negligible. Furthermore, despite the fact 
that many of the piots on the probability 
paper showed skewness, the differences 
between the M and the median was not 
very large. 


The median 1/2 judgments were 


separately calculated for the first ten 
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sessions and the second ten sessions. 
This was done in order to determine the 
effect of practice on the 1/2 judgments. 
In the present study only the data from 
the first ten sessions will'be presented. 


ments for the different standards, the per- 
centage of the standards that was ob- 
jectively incorrect and the percentage of 
the standards that was subjectively in- 
correct. The table also presents the M 


TABLE 3 


A summary of the results of the experiment on the subjective 
difficulty of memorizing and reproducing digit series 
























































Number of digits in standard 5 6 7 8 9 10 II 12 
Number on which % obiec- 
tively and subjectively incor- 
rect is based 30 40 40 50 50 60 60 60 
Be. Number judged 4 2.68 9.45 5.35 67 8.25 ke * 
Per cent objectively incorrect o ° 2.5 8.0 6.0 15.0 26.6 35.0 
Per cent subjectively incorrect ° 2.5 10.0 16.0 10.0 23.0 46.5 46.8 
Co. Number judged 4 1.30 8.36 «8.8 4.65" 3.45. 7-0 7.6 8.65 
Per cent objectively incorrect -3.0 7. 17.5 36.0 56.0 “971.5 90.0 “oe. 
Per cent subjectively incorrect o $.0. 29:¢ 28:0: 37.0 33.4 48.0 (45.0 
Dr. Number judged 4 4.98: €.96 > wee 5 .6e < 6.4¢°.6.88 <<. 7.15 754 
Per cent objectively incorrect o 956 45.5... 92.0 32.0: 63.0. -81.5° 86.5 
Per cent subjectively incorrect > ° §-0 -16.0 44.0 506.0 80.0 098.5 
Ka. Number judged } 3:3 3.38 oe. 265 se S §.45.- 6:15 6-6 
Per cent objectively incorrect 3.3 2.5 ° 16.0 22:0 (45.0 63.0 77.0 
Per cent subjectively incorrect o 2.5 ° 4.0 4:0 25.0 40.0 54.0 
Ge. Number judged 4 1.9: 3.66 4.65. 5.75: 6:0 7.2 7.6 
Per cent objectively incorrect 6.0 20.0 65.0 88.0 96.0 98.5 100.0 100.0 
Per cent subjectively incorrect o ° 2.5 10.0 18.0 21.8 52.0 655.0 
Average judged 4 1.94 2:83 - 5.08 ¢.66 «5.67 °° 6.43  °7.26°. 7.98 





When the median 1/2 judgments had 
been determined the, percentage of in- 
correct responses was calculated for 


digit series of different length. These cal- . 


culations were made only for the stand- 
ards and only for the data from the first 
ten sessions. 

Then the percentage of subjectively 
incorrect responses was calculated for 
digits of different length. These calcula- 
tions were also based only on the stand- 
ards for the first ten sessions. 

Table 3 shows the median 1/2 judge- 


of the 1/2 judgments for all the obser- 
vers. The last column in the table gives 
the number of cases on which the objec- 
tive and subjective incorrectness was 
based, (Since the standard of 12 occurred 
in combination with series of 6, 7, 8, 9, 
10 and 11 digits and the standard of 5 
occurred in combination with series of 
2, 3 and 4 digits, it is obvious that the 
standard of 12 occurred more frequently 
than the standard of 5.) 

In Figures 29-33 the median 1/2 judg- 
ments have been plotted against the 
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standards for each observer separately 
and the average 1/2 judgment function 
for all the observers is shown in Figure 
34. The points have been fitted with a 
curve by eye. It will be noted that four 
of the functions have-the same shape. 
They are approximately a straight line 
over the lower section of the range and 
become negatively accelerated near its 
| 


| 
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Fics. 29-33. Half judgment functions for 
the subjective difficulty of digit series for five 
observers (arithmetic coordinates). The co- 
ordinates represent the actual: number of 
digits in the series. 


upper end. The fifth function, that for 
Co., is best fitted by a straight line. The 
similarity between the shapes of the 1/2 
judgment functions is striking but even 
more striking is the smoothness of the 
plotted data. This is one of the best 
arguments obtainable for the reliability 
of a function. 

From these curves the magnitude func- 
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tions have been constructed in the usual 
fashion. In all of the magnitude func- 
tions the point of origin is taken to be 
four digits on the abscissa and four units 
of subjective magnitude on the ordinate. 
In other words, four units of subjective 
magnitude were arbitrarily assigned to 
represent the subjective difficulty of 
four digits. These magnitude functions 
are shown in Figures 35-39, and the 
average function in 40. 

In the same figures the percentage ob- 
jectively incorrect and the percentage 


subjectively incorrect have been plotted 


and a smooth curve has been drawn 
through the points. The actual values 
of these percentages are not shown as 
plotted, but they may be found in 
Table 3. 


SECTION E. DISCUSSION OF RESULTS 


The first problem in the results con- 
cerns the negative acceleration of the 
upper end of the 1/2 judgment function 
for all the observers except Co. The in- 
terpretation of this deceleration is that 
after a certain point increases in the 
number of digits in the standard lead to 
less and less of an ihcrease in the number 
of digits in the variable. In other words 
the subjective difficulty of memorizing 
and reproducing the series is reaching a 
limit. All of the functions show that an 
increase in the number of digits, when 
the absolute number of-digits in the 
series is small, adds less subjective diffi- 
culty than an equal increase at the mid- 
dle of the range. This is amply supported 
by. the introspections of the observers. 
All of the observers except Co. reported 
that after a certain point was reached it 
was hard to tell the difference between 
one series and another. For example 11 
digits were about as difficult as 12. This 
is certainly reasonable. If an observer 


NUMBER OF DIGITS JUDGED HALF 


has a span of 8 digits and never repro- 
duced 12 correctly, it would seem that 
the addition of further digits is not going 
to add much to the subjective difficulty 
of the task. The calculus and high school 
algebra have equal subjective difficulty 
for the I.Q. of 50. 

The one exception to this is observer 
Co., who reported that the addition of a 
digit at 11 digits was noticeable. It will 
be noted that the 1/2 judgment function 


AV. 





6 
7 
6 
5 


: 


_ RG ee BR Sa ee ee ee see, eee | ee | ee 
NUMBER OF DIGITS IN STANDARD 








Fic. 34. The half judgment function for the 
subjective difficulty of digit series for five ob- 
servers averaged (arithmetic coordinates). The 
coordinates represent the actual number of digits 
in the series. 


for Co. does not show the negative 
acceleration shown by the other ob- 
servers. The experimenter believes that 
Co.’s 1/2 judgment function would have 
the same shape if longer digit series had 
been given. 

The next question concerns the estab- 
lishment of zero subjective difficulty. If 
the magnitude functions for Co., Be., Ka. 
and Ge. were extrapolated to the ordi- 
nate, it can be readily seen that the 
result would be to assign a certain 
amount of subjective difficulty to a series 
of zero digits, In other words memoriz- 
ing and producing no digits at all would 
have a certain difficulty for these ob- 
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Fics. 35-39. The magnitude func- 
tions for the subjective difficulty of 
digit series for five observers. The 
functions for the per cent objectively 
incorrect and the subjectivity incor- 
rect are also shown. The abscissa rep- 
resents the number of digits in a given 
series, and the ordinate is subjective 
difficulty (left hand side) and percent- 
age incorrect (failed) (right hand side). 
The magnitude function is labelled 
MAG and the subjectively incorrect 
and the objectively incorrect are la- 
belled SUB and OBJ respectively. The 
coordinates are arithmetic. 
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servers. The obvious extrapolation of 
Dr.’s magnitude function would be to 
the abscissa in the neighborhood of 3 
digits. This would be interpreted that a 
series of 3 digits had zero subjective mag- 
nitude for this observer. 

But all of the observers report either 
that 2 or 3 digits have no subjective 
difficulty or that their subjective diffi- 
culty is extremely small. All of the ob- 
servers were agreed that 1 digit had no 
subjective difficulty at all. Yet the 
“obvious” extrapolation of the magni- 
tude functions of observers Be., Ge., Ka. 
and Co. is to the ordinate. 

The answer to this problem is fairly 
obvious. First, in order to obtain the 
median 1/2 judgment for any standard 
it is necessary to be able to plot at least 
two points on the probability coordi- 
nates. These two points must be greater 
than o% and less than 100%. Suppose 
that two points are obtained; if they lie 
far from 50% and on the same side of 
50% an accurate determination of the 
median by extrapolation is difficult and 
unreliable. Second, the observers reach a 
limit in subjective difficulty at both ends 
of the curve. At the upper limit, at 
maximum subjective difficulty, it is 
always possible to present the observer 
with variables that can be judged both 
greater and less than 1/2 as difficult. In 


other words it is always possible to obtain ~ 


several points on both sides of the 50% 
point on the probability paper. However 
at the lower end of the range, at mini- 
mum subjective difficulty, it is not pos- 
sible to do this. A point is reached in the 
neighborhood of 4 or 5 digits where the 
variables would always be judged as less 
than 1/2 as difficult. If the variable is 
never judged to be greater than 1/2 as 
difficult as the standard, or equal to 1/2 
the difficulty of the standard, then the 


percentage of greater judgments is 0. 
No points can be obtained by which the 
median 1/2 judgment can be found. 
Furthermore, even if an _ occasional 
greater or equal judgment is given for 
two of the variables, the points will lie 
far from the 50% points on the proba- 
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Fic. 40. The magnitude function for the sub- 
jective difficulty of digit series for the average 
of five observers (arithmetic coordinates). This 
function was constructed from Figure 34, The 
abscissa represents the number of digits in the 
series and the ordinate represents units of sub- 
jective difficulty. 





bility paper and extrapolation is either - 


impossible or extremely hazardous. This 
means, then, that points near zero sub- 
jective magnitude will be impossible to 
obtain. In short, the true direction of 
extrapolation of the magnitude function 
plots is experimentally indeterminable 
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by this method, because the true shape of 
the 1/2 judgment functions cannot be 
determined at or near zero subjective 
dificulty. However, reasoning from the 
introspective evidence and from common 
sense, it is readily seen that the “obvious” 
extrapolation of the magnitude func- 
tions to the ordinate is not correct. If 
they are extrapolated at all, they should 
be extrapolated to the abscissa. Since this 
involves a sharp change in the slope of 
the curve it cannot be done with any 
degree of confidence. 
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Fic. 41. Three types of extrapolation of a half 
judgment function (see text). 


In practice it would perhaps be better 
not to extrapolate the magnitude func- 
tion but the 1/2 judgment function on 
which it is based. Figures 41 and 42 show 
the results of extrapolation of the 1/2 
judgment function. Figure 41 represents 
a curve for some hypothetical data. ‘The 
curve has been extrapolated from the 
point X in the directions represented by 
\, B and C. Figure 42 shows the resulting 
effect on the magnitude function. Extra- 
polation of the 1/2 judgment function 
in the direction A results in the magni- 
tude function A in Figure 42; likewise 
extrapolation of the 1/2 judgment func: 
tion in the directions B and C results in 
ihe magnitude functions B and C re- 
spectively. . 

It is clear, then, that the obtained data 
are inadequate for the determination of 


zero subjective difficulty. However, one 
thing seems certain from the _intro- 
spective evidence: zero subjective diffi- 
culty is not associated with less than 1 
digit. : 

An examination of the objectively and 
subjectively incorrect judgments leads to 
some rather interesting considerations. 
The order of the systems in the magni-: 
tude functions and in the objectively 
incorrect and subjectively incorrect func- 
tions are the same. They are magnitudes 
of the same kind. The common factor is 
obviously the number of digits in the 
series. 

Furthermore all of the objectively in- 
correct and subjectively incorrect func- 
tions have the same shape. (Be.’s objec- 
tively incorrect curve has not become 
negatively accelerated at 12. Be. has a 
digit span of about 14 digits, but it can 
be assumed with maximum confidence 
that he will eventually reach a series 
where he will fail more often than he 
succeeds!) 

The similarity between the shape of 
the magnitude functions and the other 
two functions cannot be readily ex- 
plained on the basis of some factor com- 
mon to all three. Of course there is a 
common factor, the actual number of 
digits, but this merely establishes the 
fact that the systems have the same 
order. The objectively incorrect and 
subjectively incorrect functions are typi- 
cal psychometric functions. The magni- 
tude function has an upper asymptote, 
but the deceleration at the top of the 
magnitude function is due, it has been 
said, to the approach of an upper limit 
of difficulty beyond which no increase in 
number of digits adds to the subjective 
difficulty of the task. 

Some other interesting facts can be 
gleaned from a study of the functions. 
The observer Dr. was extremely em- 
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barrassed about serving as a subject. 
Although his span is well above average, 
he thought that it was low and did not 
want the experimenter, who was a 
friend, to find out how low it was. He 
found the task extremely difficult and 
worked very hard to do well. All of 
these facts are reflected in the functions. 
His magnitude function rises very steep- 
ly. He finds the task of memorizing the 
digits extremely difficult but because of 
his superhuman labor he became con- 
fident about the shorter digits. His con- 
fidence is greater than his performance 
for the lower regions of the curve. Only 
at the upper end of the curve did his 
feelings of inferiority overcome his effort. 
He was sure that he was not getting any 
of the series correct, when in actuality 
he was. 

Three of the five observers were more 
confident than their objective perform- 
ance warranted. 

A mere examination of the objective 
and subjective digit spans of these ob- 
servers would not give all the informa- 
tion obtained by actually plotting out 
the functions. A good example of this is 
found in Dr.’s functions discussed above. 
The crossing of the subjective and ob- 
jective functions at about 10.3 digits is, 
as was seen in the discussion, one of the 
interesting facts in this observer's per- 
formance. Likewise the increasing spread 
between the objective and subjective 
functions of Ka. would not be seen by 
merely comparing two points on the 
curve. Ka. was more confident than her 
performance warranted and the con- 
fidence seemed to be not too greatly 
affected by objective incorrectness. 


Throughout the experimental sessions 
she seemed to be quite self-assured. 


SECTION F. SUMMARY AND CONCLUSIONS 


1) The problem was to test the possi- 
bility of constructing an equal unit scale 
for the subjective difficulty of memoriz- 
ing and reproducing digit series. 
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Fic. 42. The three types of magnitude func- 
tion resulting from the extrapolations shown in 
Figure 41 (see text), 


2) The scales were successfully con- 
structed for each of five observers. 

3) The 1/2 judgment functions are the 
same shape for 4 of the 5 observers and 
the points for any one observer formed 
a smooth, continuous function. For this 
reason the experimenter considers the 
results reliable. 

4) The deceleration of the magnitude 
functions was due to the approach of an 
upper limit of subjective difficulty. 

5) The introspections of the observers 
indicated that there were many complex 
determinants of subjective difficulty. 

6) The analysis of the subjectively in- 
correct and objectively incorrect func- 
tions showed some individual character- 
istics paralleled by the observers’ be- 
havior during the experiment. 

”) The fact that this impalpable char- 
acteristic has been successfully scaled by 
an equal unit scale indicates that this 
type of material is open to empirical in- 
vestigation by other than purely statisti- 
cal techniques. 








Part V 


EXPERIMENTAL: THE SCALING OF THE SUBJECTIVE DIFFICULTY OF WorDs IN A 
MULTIPLE CHOICE VOCABULARY TEST 


SECTION A. THE PROBLEM 


HE PROBLEM is to construct an equal 
‘la scale of subjective difficulty for 
a multiple choice vocabulary test. If such 
a scale can be constructed, it will be 
possible to determine the relation of 
subjective difficulty to objective difficulty. 


SECTION B. DISCUSSION 
PROBLEM 


It will be remembered that the sub- 
jective difficulty of a vocabulary test was 
cited as an example of a characteristic 
that either has no stimulus correlate at 
all or that thas one that is exceedingly 
complex. The scaling of such a charac- 
teristic was discussed in Part I. The con- 
clusion drawn from that discussion was 
that no other measurable magnitude is 
necessary for the construction of a sub- 
jective scale. All that is necessary is to 
have identifiable systems. 

There are several ways in which vo- 
cabulary items may be identified; the 
first and most obvious is by the word 
itself, since the words to be defined in 
different items are different; second, if 
the words are the same but have different 
multiple choice words, the item can be 
identified by the multiple choice words; 
third, it can be identified by its objec- 
tive difficulty. If the items are identified 
by the third means all items with the 
same objective difficulty are in principle 
indistinguishable, i.e., they are indistin- 
guishable so far as objective difficulty is 
concerned. The first necessity in con- 
structing the equal unit scale is to estab- 
lish an ordinal scale. This was not done 
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either in the case of speed or in the case 
of the digit series; from previous knowl- 
edge and experience there is maximum 
certainty that subjective rate increases as 
a function of physical rate and that the 
subjective difficulty of memorizing and 
producing the digit series increases as a 
function of the number of the digits in 
the series. In short, the order is the same. 
The magnitude functions prove that this 
is true; if it were not true the function 
would either change slope and descend 
at some point or there would be reversals 
in the percentage judged greater than 
1/2 in the raw data. 

In the present case it was assumed that 
subjective difficulty would be positively 
correlated with percentage of the total 
population passing the item, so no 
ordinal scale was constructed. This as- 
sumption will be discussed later. As in. 
the case of visual rate and the subjective 
difficulty of the digit series a ready made 
ordinal scale was at hand. 

The vocabulary items were obtained 
from the I.E.R. Inventory of intellectual 
tasks and their difficulty (53). These 
items have been scaled in objective diffi- 
culty by a statistical scaling technique 
described by Thorndike (47). The units 
in which the items are scaled are sup- 
posedly “equal” and supposedly meas- 
ured from an “absolute zero.” In Part I 
there was a discussion of the “equality” 
obtained by the statistical techniques and 
no more need be said of it here. In 
plotting subjective difficulty against 
Thorndike’s units, the question arises, 
how should Thorndike’s units be spaced 
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on the abscissa? There is no rule for 
spacing his units unless his scale is con- 
sidered as a B-magnitude with a postu- 
lated relation between “difficulty” and 
the units. This matter was also discussed 
in Part I. 

The difficulty of the items in Thorn- 
dike’s units ranged from 220 to 442 or, in 
other words, if one accepts Thorndike’s 
units as equal and his zero as absolute, 
the most difficult item was about “twice” 
as difficult as the easiest. These units, 
based on percentage passing, are extreme- 
ly fine. There seemed to be no detectable 
change in subjective difficulty over a 
five point range. So to simplify the col- 
lection of the material for the experi- 
ment, the total range of Thorndike’s 
units was divided into 43 new large units, 
each unit consisting of a range of ap- 
proximately 5 of Thorndike’s units. The 
new unit 1 contained Thorndike’s words 
of difficulty 220-224; 2 contained 225-229; 
3 contained 230-234; 4 contained 235-239, 
etc. The chief exception to this was the 
new unit number 43 which contained 
431-442. Several of the new units con- 
tained only four of Thorndike’s units. 

The vocabulary items of 220-300 of 
Thorndike’s units are multiple choice 
picture vocabulary items. These items 
are numbers 1 to 16 in the new units. 
Values in the succeeding discussion will 
be stated in the new units. A summary 
of the criticisms leveled against the scal- 
ing of this test by Thorndike will be 
found in Gansl (17). Despite the faults 
of the scale it is highly probable that 
what errors there are would not be of 
sufficient magnitude to affect the relation 
between the percentage passing and sub- 
jective difficulty. 

It is now necessary to return to the 
discussion of the identification marks of 
the systems. In the present case the 
identification marks are the new units 


1-43. (Within any one of these units the 
separate tests are, of course, “identified” 
by the word to be defined.) These units 
will also be used on the abscissa of the 
magnitude function. An objection may 
be raised that these units are not equal 
in the sense that Thorndike’s original 
units were equal because they do not 
contain an equal number of Thorndike’s 
units. The answer is, first, most of them 
do contain an equal number of Thorn- 
dike’s units; second, the error introduced 
is certainly no larger than that neces- 
sarily introduced by the random selec- 
tion of the items within any one of the 
units. The one exception is new unit 43 
which contains ten of Thorndike’s units. 
To compensate for this difference 43 has 
been plotted twice the usual distance 
from the unit next below it. 


SECTION C. PROCEDURE 


The procedure was similar to that 
used in the determination of the median 
1/2 judgment for the digit series. There 
were ten standards and each standard 
was compared with a series of variables. 
The task of the observer was to give a 
judgment of “greater than 1/2 as difh- 
cult,” “1/2 as difficult” or “less than 1/2 
as difficult.” At some levels there were 
not enough items to pair the standard 
with more than one of each of the values 
of the variable. Table 4 gives the values 
of the standards and the variables with 
which they were compared together with 
the number of comparisons for each of 
the values of the variable. 

The items were printed singly on 
separate sheets of paper and presented in 
pairs to the observer. He was instructed 
to do the first item (the standard) and 
then to do the second item (the variable) 
and make his judgment. The instruc- 
tions were similar to the instructions for 
the digit series; the only real difference 
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TABLE 4 


Showing the values of the standards, the vari- 
ables with which they were compared and the 
number of judgments made for each com- 
parison by each observer 











Standard Variable N 
43 4° I 
43 34 I 
43 31 I 
43 28 I 
43 25 I 
42 39 . 
42 36 I 
42 33 . 
42 27 I 
42 24 I 
4! 38 I 
41 35 : 
41 32 I 
4! 26 I 
41 23 I 
39 37 I 
39 32 I 
39 29 . 
39 25 I 
39 22 I 
37 35 2 
37 3! 2 
37 28 2 
37 25 2 
37 22 2 
34 32 3 
34 29 3 
34 26 3 
34 22 3 
34 16 3 
31 28 3 
31 25 3 
31 22 3 
31 12 3 
25 22 2 
25 16 2 
25 8 2 
25 5 2 
15 13 I 
15 9 I 
15 6 I 
15 3 I 

6 4 I 
6 3 I 
6 2 I 
6 I I 





} 


was due to the nature of the material. 
Che pairs of words were presented in a 


} 


random order to the observers. The pair- 
ing of standard and variable for any one 
comparison was randomized, i.e., the 
comparison of 43 and 40 did not mean 
that the identical items were paired for 
every observer but only that an item with 
the value 43 was compared with an item 
of 40. 
There were fifteen observers. 


SECTION D. RESULTS 


The number of “greater than 1/2 
judgments made by each of the observers 
for each of the variables for any given 
standard was added. The sums were con- 
verted into percentages and plotted on 
probability paper, and the median 1/2 
judgment obtained in the way outlined 
for the digit series. 

Table 5 shows the median 1/2 judg- 
ment for all the observers together for 
each of the standards. 

The data have been plotted and a 
curve fitted to the points by eye (Fig. 
43). From this curve a magnitude func- 
tion has been plotted in the usual man- 
ner. The origin is at'10 objective units 
on the abscissa and 10 subjective units 
on the ordinate (Fig. 44). 


” 


SECTION E. DISCUSSION OF RESULTS 


The first thing to note about the 
curve in Figure 43 is the reversal at its 
upper end. It will be remembered that 
it was assumed that subjective difficulty 
would be correlated positively with the 
percentage of the general population 
passing the items. This is true for most 
of the range of difficulty but ceases to 
be true at the upper end of the curve. 
In other words, the assumption that the 
two functions have the same order. was 
not correct for the entire range, There is 
some introspective evidence that might 
explain this. The upper levels of diffi- 
culty contain multiple choice words 
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TABLE 5 
Showing the median } judgment for each of the standards 





Standard 43 42 


4! 39 37 34 31 


Median $ judgment 34.8 36.5 35-0 35.0 22.8 14.0 10.0 





which have a certain confusion-value 
for the observer. The word to be de- 
fined is so difficult that the observer 
seizes on the most likely (but often in- 
correct) word. He bases his choice on the 
apparent similarity of the stems or on 
some other false criterion. But the net 
effect of this is not to make the task sub- 
jectively more difficult. The reversal may 
not be completely explained on this basis, 
but it is at least partially explained. 

The assumption that the subjective 
difficulty of the items and the objective 
difficulty would have the same order 
seemed a safe assumption. The results 
of Farmer (12), of Hertzman (25) and 
of the experimenter’s own digit span 
experiment had led him to believe that 
the assumption was probably valid. 

A more correct procedure would be 
first to order the systems empirically; 
second, to group together all those sys- 
tems that were judged to be subjectively 
equal; third, to scale these groups of sys- 


» 
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STANDARD ITEMS 
Fic. 43. The half judgment function for the 
subjective difficulty of vocabulary test items 


(arithmetic coordinates). The coordinates repre- 
sent objective difficulty as defined in the text. 


tems in a manner similar to that used 
in the present experiment. 

One technical difficulty has arisen in 
the construction of the magnitude func- 
tion. The 1/2 judgment function has a 
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Fic. 44. The magnitude function for the sub- 
jective difficulty of vocabulary test items (arith- 
metic coordinates). The abscissa represents ob- 
jective difficulty and the ordinate represents 
subjective difficulty. The circles are explained in 
the text. 


very steep slope which means that few 
points can be obtained for the construc- 
tion of the magnitude function. The 
result is that it is difficult to fit a curve 
to points of the magnitude function. 
The only way that this can be done is by 
trial and error, checking the curve for 


. internal consistency as described in the 


discussion of discontinuous functions. 
The obtained magnitude function was 
checked by the method of equal appear- 
ing intervals.** The procedure was simi- 
lar to that adopted for the main part of 
the experiment. In this case, however, 
the observers were given three items and 
were told to do all of the items and then 


* This experiment was carried out by W. R. 
Brunner, Columbia College. 
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to judge whether the interval of subjec- 
tive difficulty between the first and sec- 
ond item was “greater than” “equal to” 
or “less than” the interval in difficulty 
between the second and third items. The 
results were handled in the same manner 
as in the main experiment. Two inter- 
vals were used: 43-34 and 37-25. The 
interval 47-41.6 was judged to be equal 
to 41.6-94 and the interval 37-35.8 was 
judged to be equal to the interval 35.8- 
25. 

These points were plotted on the mag- 
nitude function (Fig. 42) in the manner 
described by Stevens and Volkmann 
(42). The limiting points of the interval 
were made to coincide with the mag- 
nitude function. The check is to see 
whether the bisecting point lies on the 
magnitude function. If it does the 
methods check. The bisection predicted 
by the magnitude function for the 47-34 
interval is 39.8 whereas the obtained bi- 
section is 41.6; the predicted bisection of 


the 37-25 interval is 34 and the obtained 
bisection is 35.8. The agreement is not 
perfect but the error is not great. 


SECTION F. SUMMARY AND CONCLUSIONS 


1) The problem was to construct an 
equal unit scale for the subjective diffi- 
culty of multiple choice vocabulary 
items. 

2) The scale was constructed, but it 
possesses certain shortcomings. The as- 
sumption that the order of the items 
with respect to subjective difficulty was 
the same as the order with respect to 
percentage passing broke down at the 
upper end of the function. The correct 
way of constructing the function with- 
out making this assumption is described. 

3) The. magnitude function was 
checked by the method of equal appear- 
ing intervals. The results of this opera- 
tion did not agree perfectly with the 
magnitude function but the margin of 
error was not great. 





GENERAL 


HE PURPOSE of this study was to ex- 
F goin the possibility of measuring 
all those discriminable characteristics 
that exist in discriminable degrees. 

The logical criteria for measurement 
were critically examined and the psy- 
chological scaling methods were analyzed 
in the light of these criteria. 

The conclusion was drawn that none 
of the attempts at measurement, used so 
far by psychologists, meet the necessary 
criteria for fundamental measurement. 
It was argued that the equal unit scale 
meets more of the necessary criteria than 
any other set of operations. The further 
conclusion was drawn that there is no 
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SUMMARY 


requirement for measurement of any 
type that psychology in principle cannot 
meet, since the criteria of physical juxta- 
position is not a requisite for addition. 

Equal unit scales were constructed for 
three discriminable characteristics that 
differed in important respects from 
each other and that presented special 
theoretical and practical problems. It is 
concluded that it is possible to construct 
equal unit scales, i.e., scales that meet 
the logical requirements for order and 
equality of units for discriminable char- 
acteristics that are both impalpable and 
without a known correlated stimulus cor- 
relate. 
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